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PREFACE 


soning underlies all analysis of computer systems. It plays a 
operations research problems and in finite probability. Two 
of the most basic mathematical aspects of computer science concern the speed and 
logical structure of a computer program. Speed involves enumeration of the number 
of times each step in a program can be performed. Logical structure involves flow 
charts, a form of graphs. Analysis of the speed and logical structure of operations 
research algorithms to optimize efficient manufacturing or garbage collection entails 
similar combinatorial mathematics. Determining the probability that one of a certain 
subset of equally likely outcomes occurs requires counting the size of the subset. 
Such combinatorial probability is the basis of many nonparametric statistical tests. 
Thus, enumeration and graph theory are used pervasively throughout the mathematical 
sciences. 

This book teaches students how to reason and model combinatorially. It seeks to 
develop proficiency in basic discrete math problem solving in the way that a calculus 
textbook develops proficiency in basic analysis problem solving. 

The three principal aspects of combinatorial reasoning emphasized in this book 
are the systematic analysis of different possibilities, the exploration of the logical 
structure of a problem (e.g., finding manageable subpieces or first solving the problem 
with three objects instead of n), and ingenuity. Although important uses of combina- 
torics in computer science, operations research, and finite probability are mentioned, 
these applications are often used solely for motivation. Numerical examples involving 
the same concepts use more interesting settings such as poker probabilities or logical 
games. 

Theory is always first motivated by examples, and proofs are given only when 
their reasoning is needed to solve applied problems. Elsewhere, results are stated 
without proof, such as the form of solutions to various recurrence relations, and then 
applied in problem solving. Occasionally, a few theorems are stated simply to give 
students a flavor of what the theory in certain areas is like. 

For decades, collegiate curriculum recommendations from the Mathematical 
Association of America have included combinatorial problem solving as an impor- 
tant component of training in the mathematical sciences. Combinatorial problem 
solving underlies a wide spectrum of important subjects in the computer science 
curriculum. Indeed, it is expected that most students in a course using this book will be 
computer science majors. For both mathematics majors and computer science majors, 
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this author believes that general reasoning skills stressed here are more important than 
mastering a variety of definitions and techniques. 

This book is designed for use by students with a wide range of ability and maturity 
(sophomores through beginning graduate students). The stronger the students, the 
harder the exercises that can be assigned. The book can be used for a one-quarter, 
two-quarter, or one-semester course depending on how much material is used. It may 
also be used for a one-quarter course in applied graph theory or a one-semester or 
one-quarter course in enumerative combinatorics (starting from Chapter 5). A typical 
one-semester undergraduate discrete methods course should cover most of Chapters 
1 to 3 and 5 to 8, with selected topics from other chapters if time permits. 

Instructors are strongly encouraged to obtain a copy of the instructor’s guide 
accompanying this book. The guide has an extensive discussion of common student 
misconceptions about particular topics, hints about successful teaching styles for this 
course, and sample course outlines (weekly assignments, tests, etc.). 

The sixth edition of this book draws upon features from all the earlier editions. 
For example, the game of Mastermind that appeared at the beginning of the first 
edition has been brought back, and a closing Postlude about cryptanalysis has been 
added. The suggested solutions to selected enumeration exercises from the second 
and third editions have returned. Of course, there are also new exercises. Also, the 
numbers were changed in many of the old exercises in the counting chapters (to guard 
against student groups accumulating old solution sets). 

Many people gave useful comments about early drafts and the first edition of 
this text; Jim Frauenthal and Doug West were especially helpful. The idea for this 
book is traceable to a combinatorics course taught by George Dantzig and George 
Polya at Stanford in 1969, a course for which I was the grader. Many instructors 
who have used earlier editions of this book have supplied me with valuable feedback 
and suggestions that have, I hope, made this edition better. I gratefully acknowledge 
my debt to them. Ultimately, my interest in combinatorial mathematics and in its 
effective teaching rests squarely on the shoulders of my father, A. W. Tucker, who 
had long sought to give finite mathematics a greater role in mathematics as well as 
in the undergraduate mathematics curriculum. Finally, special thanks go to former 
students of my combinatorial mathematics courses at Stony Brook. It was they who 
taught me how to teach this subject. 


Alan Tucker 

Stony Brook, New York 
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PRELUDE 



This book seeks to develop facility at combinatorial reasoning, which is the basis 
for analyzing a wide range of problems in computer science and discrete applied 
mathematics. As a warm-up exercise for such reasoning, this Prelude presents the 
game of Mastermind. Mastermind was introduced in the 1970s and attained such 
popularity in England that in 1975 a British National Mastermind Championship was 
held with overflow crowds. Mastermind uses the same type of combinatorial reasoning 
that underlies the mathematics in this book but uses it in a recreational setting. 

The objective of the game is to guess a secret code consisting of colored pegs. 
The secret code is a row of four pegs that may be chosen (with repeats) from the 
colors red (R), white (W), yellow (Y), green (G), blue (Bu), and black (Bk). Each 
guess of a possible secret code is scored to give some information about how close 
the guess is to the real secret code. Specifically, the player who chose the secret code 
indicates (1) how many of the code pegs in the guess are both of the right color and in 
the right position in the row, and (2) how many of the code pegs are of the right color 
(occur somewhere in the secret code) but in the wrong position. These two pieces of 
information are recorded in the form of black keys for ( 1 ) and white keys for (2). The 
game ends when the secret code has been correctly guessed, that is, a score of four 
black keys is given to a guess. The guesses in Example 1 indicate how this scoring 
procedure works. 


Example 1: Mastermind Scoring 


Secret Code 

R Bu Y Y 

Scoring 

Comments 

Guess 1 

Bu W 

GY 

•0 

A peg can receive at most one 
key; so the Y earns one black key. 

Guess 2 

Y G 

GY 

• 0 


Guess 3 

Bk G 

GW 


A null score is actually very helpful; 
it eliminates all three colors. 

Guess 4 

BuBuRR 

• 0 

There is only one blue peg in the secret 
code, and so only one blue peg earns a 
key; it gets the best key possible, black. 


xt 


xii Prelude 


This game has many good associated counting problems. How many secret codes 
are possible, how many different sets of black and white scoring keys are possible, how 
many secret codes are possible given a particular first guess and its scoring? This game 
is conceptually similar to several important problems in information classification and 
retrieval. In computer recognition of human speech, chemical compounds, or other 
complex data sets, a number of cleverly planned queries must be made about the data. 
Compilers perform a variety of sequential tests, usually in the form of binary search 
trees (discussed in Chapter 3), to identify commands in the text they are compiling. 
Many of the theoretical analyses associated with efficient recognition require exactly 
the same reasoning and techniques as the Mastermind counting problems. 

Although we justify our discussion of Mastermind in terms of related counting 
problems, the game itself provides enjoyable recreation and we encourage readers 
to play it with a friend. In the absence of a playing companion, the exercises in 
this section present games in which enough guesses have been made and scored to 
enable one to determine the secret code. The following example illustrates how these 
exercises can be analyzed. 

Example 2: Find a Secret Code 

The following guesses and scoring have been occurred. Readers should try to 
determine the secret code for themselves before reading the analysis below. 


Scoring 


Guess 1 

W 

GBuR 

•0 

Guess 2 

Bk YR 

Bu 

•0 

Guess 3 

R 

RG 

Y 

000 

Guess 4 

Y 

RR 

W 

• • 


Consider color red. Guess 3 with two reds and three scoring keys indicates that 
there must be at least one red peg in the secret code; but since the keys are white, the 
red(s) must be in position 3 or 4. However, guess 4 has only black keys and the red 
pegs are in positions 2 and 3. Since guess 3 indicates that there cannot be a red peg 
in position 2, we conclude that the secret code has exactly one red peg in position 3 
and no other red peg (or else the red peg in position 2 in Guess 1 would earn a white 
key). Note that the black key in guess 2 must also be due to the red in position 3. 

Guess 3 now implies that the green and yellow pegs must have earned two of the 
three white keys (since one of the reds received no key). Since yellow is somewhere 
in the secret code, then guess 4 tells us that yellow must be in position 1. So the 
secret code is of the form Y_R_. Moreover, yellow cannot also occur in position 2 or 
4, because it appears there in guesses 2 and 3, respectively, without earning a black 
key. So there is only one Y, as well as just one R, in the secret code. Since the keys 
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in guesses 2 and 4 are now known to be for yellow and red, the other colors in these 
two guesses — black, blue, and white — cannot be in the secret code. Then the only 
remaining color that can appear in positions 2 and 4 is green, and the secret code 
isYGRG. ■ 


More information about strategies for playing Mastermind can be found in The 
Official Mastermind Handbook by Ault [1]. 



EXERCISES 


1. Determine the secret code for 


GY R Bu oooo 

G Bu R Y #*oo 

2. Determine the two possible secret codes (one has no repeated colors) for 


Bk 

Bu 

Y 

W 

00 

R 

W 

Bu 

G 

• 00 

Bu 

G 

R 

G 

00 

Y 

R 

W 

Bu 

• 00 

1. Determine the secret code for 

Bu 

Bu 

Bk 

W 

• •0 

Y 

W 

Bk 

W 

• • 

W 

Bu 

R 

G 

•0 

Y 

Bu 

Bk 

R 

•• 


4 . Determine the two possible secret codes for 


5 . 


6 . 


R 

G 

Y 

W 

00 

Y 

Bk 

W 

Bu 

• 0 

G 

R 

R 

G 

• 0 

Bk 

Bu 

G 

R 

• 0 

w 

R 

Bk 

R 

0 

Find a fourth guess 

whose scoring 

for 





G 

Y 

Bk 

R 

•• 

Y 

Bu 

G 

W 

•0 

Bu 

W 

Y 

Y 

0 


Find a fourth guess whose scoring 
for 


will allow you to determine the secret code 


will allow you to determine the secret code 




G R Bu W 
G Y Bu G 
Y Bk Bk Y 


o 
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7. A seventh color, orange (O), has been added in this game. Determine the secret 


code for 




0 

R 

Bu 

G 

00 

Y 

0 

W 

Y 

0 

R 

Bk 

Bu 

G 

000 

0 

R 

R 

Y 

•0 

Bk 

Bu 

0 

R 

00 


8. A seventh color, orange (O), has been added. Determine the two possible secret 


codes for 



0 

R 

G 

Bu 

00 

Y 

Y 

W 

0 

•0 

R 

Bu 

Bu 

Bk 

•0 

Bk 

O 

Y 

R 

•0 

Bu 

Bk 

0 

Y 

•00 


9. A seventh color, orange (O), has been added. Determine the seven possible secret 
codes for 

O Y W Bk o 

W Bk O R *o 

G O R R • 

Y R Bu O oo 

10. There are now five positions and two extra colors: orange (O) and pink (P). 
Determine the secret code for 

Bk G G O O #*o 

W O Bk G R #*o 

O Bk P Y Bu .o 

G P Y R W oo 

11. There are now five positions and three extra colors: orange (O) pink (P), and 
violet (V). Determine the secret code for 


R V Bk G O oo 

V R V R Y .o 

Bk W W Bu Bk • 

P O P O G oo 

G Y Bu W P .oo 

W P V Bu Y .oo 

Y Bu R V Bk ooo 

12. Find the probability that your initial guess in a Mastermind game is correct. 

13. (a) What sets of four or fewer black and white keys can never occur in Master- 

mind scores? 


(b) How many different sets of black and white keys can occur in Mastermind 
scores? 
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14 . We call two guesses in Mastermind similar if one can be obtained from the other 
by a permutation of positions and/or a permutation of colors. How many different 
(nonsimilar) guesses are there? 

15 . Suppose your first guess uses four different colors and the score is four white 
keys. How many different secret codes are possible? (Note that all four-color 
guesses are similar; see Exercise 14.) 

16 . Suppose your first guess uses three different colors (one color is repeated) and 
its score is one black and three whites. How many different secret codes are 
possible? (Note that all three-color guesses are similar; see Exercise 14.) 

17 . Suppose your first guess uses at least two colors and its score is no keys. What 
is the minimum number of secret codes that are eliminated by this guess? 

18 . Consider the simplified Mastermind game in which there are four pegs, each 
of a different color, and the secret code consists of some arrangement of these 
four pegs. Develop a complete strategy for playing this game so that you can 
determine the secret code after at most three guesses (by the fourth guess, you 
get a score of four black keys). 

19 . Consider the simplified Mastermind game in which there are three positions and 
three colors of pegs. Find an optimal first guess for this game. A guess is optimal 
if for some number k, all possible scores of the guess leave at most k possible 
secret codes and some possible score of any other guess leaves at least k possible 
secret codes. 

20 . Consider the simplified Mastermind game in which there are four positions 
but only two colors of pegs. Find an optimal first guess for this game (see 
Exercise 19). 

21 . Show that no matter what the scores of these four guesses, any secret code can 
be correctly guessed (using the scores of these first four guesses) in at most two 
more guesses. 

R R W W 
R Bk R Bk 
Bu Bu G G 
Bu Y Bu Y 


22 . Write a computer program to make up secret codes and score guesses. 

23 . Write a program to play Mastermind. 
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CHAPTER 1 

ELEMENTS OF GRAPH 
THEORY 



GRAPH 


MODELS 


The first four chapters of this book deal with graphs and their applications. A graph 
G = ( V , E ) consists of a finite set V of vertices and a set E of edges joining dif- 
ferent pairs of distinct vertices.* Figure 1.1a shows a depiction of a graph with 
V — {a, b , c, d] and E — {(a, b), (a, c), (a, d ), ( b , d), (c, t/)}. We represent vertices 
with points and edges, and lines joining the prescribed pairs of vertices. This defini- 
tion of a graph does not allow two edges to join the same two vertices. Also, an edge 
cannot “loop” so that both ends terminate at the same vertex — an edge’s end vertices 
must be distinct. The two ends of an undirected edge can be written in either order, 
(b, c) or (c, b). We say that vertices a and b are adjacent when there is an edge {a, b ). 

Sometimes the edges are ordered pairs of vertices, called directed edges. In a 
directed graph, all edges are directed. See the directed graph in Figure 1.1 b. We write 
(IT, c ) to denote a directed edge from b to c. In a directed graph, we allow one edge in 
each direction between a pair of vertices. See edges {a, c ) and (c, a) in Figure 1.1 b. 

The combinatorial reasoning required in graph theory, and later in the enumer- 
ation part of this book, involves different types of analysis than are used in calculus 
and high school mathematics. There are few general rules or formulas for solving 
these problems. Instead, each question usually requires its own particular analysis. 
This analysis sometimes calls for clever model-building or creative thinking, but more 
often consists of breaking the problem into many cases (and subcases) that are easy 
enough to solve using simple logic or basic counting rules. A related line of reasoning 
is to solve a special case of the given problem and then to find ways to extend that 



* What this book calls a graph is referred to in many graph theory books as a simple graph. In general, 
graph theory terminology varies a little from book to book. 
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reasoning to all the other cases that may arise. The underlying theme here is summa- 
rized by a famous quote from the great problem-solver George Polya: “The challeng- 
ing part is asking the right questions. Then the answers are easy.” 

In graph theory, combinatorial arguments are made a little easier by the use 
of pictures of the graphs. For example, a case-by-case argument is much easier to 
construct when one can draw a graphical depiction of each case. 

Graphs have proven to be an extremely useful tool for analyzing situations in- 
volving a set of elements in which various pairs of elements are related by some 
property. The most obvious examples of graphs are sets with physical links, such 
as electrical networks, where electrical components (transistors) are the vertices and 
connecting wires are the edges; or telephone communication systems, where tele- 
phones and switching centers are the vertices and telephone lines are the edges. Road 
maps, oil pipelines, and subway systems are other examples. 

Another natural form of graphs is sets with logical or hierarchical sequenc- 
ing, such as computer flowcharts, where the instructions are the vertices and the 
logical flow from one instruction to possible successor instruction(s) defines the 
edges; or an organizational chart, where the people are the vertices and if person A 
is the immediate superior of person B , there is an edge (A” B ). Computer data struc- 
tures, evolutionary trees in biology, and the scheduling of tasks in a complex project 
are other examples. 

The emphasis in this book will be on problem solving, with problems about 
general graphs and applied graph models. Observe that we will usually not have any 
numbers to work with, only some vertices and edges. At first, this may seem to be 
highly nonmathematical. It is certainly very different from the mathematics that one 
learns in high school or in calculus courses. However, disciplines such as computer 
science and operations research contain as much graph theory as they do standard 
numerical mathematics. 

This section consists of a collection of illustrative examples about graphs. We 
will solve each problem from scratch with a little logic and systematic analysis. Many 
of these examples will be revisited in greater depth in subsequent chapters. 

The following three graph theory terms are used in the coming examples. A path 
P is a sequence of distinct vertices, written P = x \ — X 2 — • -x n , with each pair of 
consecutive vertices in P joined by an edge. If in addition there is an edge (x n , xi), the 

sequence is called a circuit, written X 1 -X 2 x f ,-X| . For example, in Figure 1.1a, 

b-d-a-c forms a path, while a-b-d-c-a forms a circuit. A graph is connected if there is 
a path between every pair of vertices. The removal of certain edges or vertices from 
a connected graph G is said to disconnect the graph if the resulting graph is no longer 
connected — that is, if at least one pair of vertices is no longer joined by a path. The 
graph in Figure 1.1a is connected, but the removal of edges {a, b) and ( b , d ) will 
disconnect it. 


Example 1: Matching 

Suppose that we have five people A, B , C, D, E and five jobs a, b, c, d, e, and that 
various people are qualified for various jobs. The problem is to find a feasible one- 
to-one matching of people to jobs, or to show that no such matching can exist. We 
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can represent this situation by a graph with vertices for each person and for each job, 
with edges joining people with jobs for which they are qualified. Does there exist a 
feasible matching of people to jobs for the graph in Figure 1.2? 

The answer is no. The reason can be found by considering people A, B, and I). 
These three people as a set are collectively qualified for only two jobs, c and d. Hence 
there is no feasible matching possible for these three people, much less all five people. 
An algorithm for finding a feasible matching, if any exists, will be presented in Chapter 
4. Such matching graphs in which all the edges go horizontally between two sets of 
vertices are called bipartite. Bipartite graphs are discussed further in Section 1.3. ■ 

Example 2: Spelling Checker 

A spelling checker looks at each word X (represented in a computer as a binary 
number) in a document and tries to match X with some word in its dictionary, which 
typically contains close to 100,000 words. To understand how this checking works, 
we consider the simplified problem of matching an unknown letter X with one of the 
26 letters in the English alphabet. In the spirit of the strategy humans use to home in on 
the page in a dictionary where a given word appears, the computer search procedure 
would first compare the unknown letter X with M, to determine whether X < M or 
X > M. The answer to this comparison locates X in the first 13 letters of the alphabet 
or the second 13 letters, thus cutting the number of possible letters for X in half. 
This strategy of cutting the possible matches in half can be continued with as many 
comparisons as needed to home in on X’s letter. For example, if X < M, then we 
could test whether or not X < G; if X > M, we could test whether X < S. 

This testing procedure is naturally represented by a directed graph called a tree. 
Figure 1 .3 shows the first three rounds of comparisons for the letter-matching pro- 
cedure. The vertices represent the different letters used in the comparisons. The left 
descending edge from a vertex Q points to the letter for the next comparison if X < Q, 
and the right descending edge from Q points to the next letter if X > Q. 

For our original spelling-checker problem, a word processor would use a similar, 
but larger, tree of comparisons. With just 12 rounds of comparisons, it could reduce 


M 



Figure 1.3 d j p w 
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the number of possible matches for an unknown word X from 100,000 down to 25, 
about the number of words in a column of a page in a dictionary. (Once a list was 
reduced to about 25 possibilities, a computer search for X would usually run linearly 
down that list, just as a human would.) ■ 

Chapter 3 examines trees and their use in various search problems. Trees can be 
characterized as graphs that are connected and that have a unique path between any 
pair of vertices (ignoring the directions of directed edges). The next example uses 
trees in a very different way. 

Example 3: Network Reliability 

Suppose the graph in Figure 1 .4 represents a network of telephone lines (or electrical 
transmission lines). We are interested in the network’s vulnerability to accidental 
disruption. We want to identify sets of those lines and switching centers that must 
stay in service to avoid disconnecting the network. 

There is no telephone line (edge) whose removal will disconnect the telephone 
network (graph). Similarly, there is no vertex whose removal disconnects the graph. 

Is there any pair of edges whose removal disconnects the graph? There are several 
such pairs. For example, we see that if the two edges incident to a are removed, vertex 
a is isolated from the rest of the network. A more interesting disconnecting pair of 
edges is ( b , c), (j, k). It is left to the reader as an exercise to find all disconnecting 
sets consisting of two edges for the graph in Figure 1.4. 

Let us take a different tack. Suppose we want to find a minimal set of edges 
needed to link together the 11 vertices in Figure 1.4. There are several possible 
minimal connecting sets of edges. By inspection, we find the following one: (a, b ), 
( b , c), ( c , d ), (d, h), (h, g), (h, k), ( k,j ), ( j,f ), (j, i ), (i, e); the edges in this minimal 
connecting set are darkened in Figure 1 .4. A minimal connecting set will always be a 
tree. One interesting general result about these sets is that if the graph G has n vertices, 
then a minimal connecting set for G (if any exists) always has n — 1 edges. ■ 

The number of edges incident to a vertex is called the degree of the vertex. 

Example 4: Street Surveillance 

Now suppose the graph in Figure 1.4 represents a section of a city’s street map. We 
want to position police officers at comers (vertices) so that they can keep every block 
(edge) under surveillance — that is, every edge should have police officers at (at least) 
one of its end vertices. What is the smallest number of police officers that can do this 
job? 

Let us try to get a lower bound on the number of police officers needed. The map 
has 14 blocks (edges). Corners b, c, e,f, h, and j each have degree 3, and corners a, d, 
g, i, and k each have degree 2. Since four vertices can be incident to at most 4x3= 12 
edges but there are 14 edges in all, we will need at least five police officers. We shall 
now try to find a set of five vertices incident to all the edges. If we can find such a set, 
we know that it is the best (smallest) solution possible. 

If all five police officers were positioned at degree-3 vertices, then 5x3=15 
edges are watched by the five police officers. Since there are only 14 edges, some 
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edge would be covered by police officers at both end vertices. If four police officers 
are at degree-3 vertices and one at a degree-2 vertex, then exactly 14 edges are 
watched — and no edge need be covered at both ends. (If fewer than four of the 
five police officers are at degree-3 vertices, we could not watch all 14 edges). With 
these general observations, we are ready for a systematic analysis to try to find five 
vertices that are collectively adjacent to all 14 edges. 

Consider edge (c, d). Suppose it is watched by an officer at vertex d. Then vertex 
c (the other end vertex of edge (c, d)) cannot also have an officer, since we noted above 
that if we use a degree-2 vertex, such as d, then no edge can be watched from both 
end vertices. However, if vertex c cannot be used, then edge (c, g) must be watched 
from its other end vertex g. But now we are using two degree-2 vertices, d and g. We 
noted above that at most one degree-2 vertex can be used. We got into this trouble by 
assuming that edge (c, d) is watched from vertex d. 

Now assume no officer is at vertex d. Then we must watch edge (c, d) with an 
officer at vertex c. Similarly, edge id, h) can be watched only by placing an officer at 
vertex h. Next look at edge (h, k). It is already watched by vertex h. Then we assert 
that ( h , k) cannot also be watched by an officer at vertex k, since k has degree-2 and 
we noted above that if we use a degree-2 vertex, no edge can be watched from both 
ends. We conclude that there cannot be an officer at vertex k. Then edge (k,j) can be 
watched only by placing an officer at vertex j. We now have officers required to be at 
vertices c, h, and j. 

Similar reasoning shows that with an officer at vertex j , there cannot be an officer 
at vertex i; then there must be an officer at vertex e; then there cannot be an officer at 
vertex a; and then there must be an officer at vertex b. In sum, we have shown that we 
should place police officers at vertices c, h, j, e, and b. A check confirms that these 
five vertices do indeed watch all 14 edges. A smallest number of police officers for 
this surveillance problem has been found. Note that since our reasoning forced us to 
use exactly these five vertices, no other set of five vertices can work. 

At the beginning of this example, we showed that at least five corners were 
needed to keep all the blocks (edges) under surveillance. Now we have produced a set 
of five corners that achieve such surveillance. It then follows that five is the minimum 
number of corners. 

We conclude this example by noting that in this surveillance situation, one can 
also consider watching the vertices rather than the edges: How few officers are needed 
to watch, that is, be at or adjacent to, all the vertices? We use the same type of 
argument as in the block surveillance problem to get a lower bound on the number 
of corners needed for corner surveillance. An officer at vertex x is considered to be 
watching vertex x and all vertices adjacent to x. There are 1 1 vertices, and six of these 
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vertices watch four vertices (themselves and three adjacent vertices). Thus three is the 
theoretical minimum. This minimum can be achieved. Details are left as an exercise. ■ 

A set C of vertices in a graph G with the property that every edge of G is incident 
to at least one vertex in C is called an edge cover. The previous example was asking 
for an edge cover of minimal size in Figure 1 .4. The reasoning in Example 4 illustrates 
the kind of systematic case-by-case analysis that is common in graph theory. 

The analysis in the previous example also illustrates a principle that is used over 
and over again in graph theory and other combinatorial settings. Namely, to show a 
graph has some property — in this case, the existence of a five-vertex edge cover — we 
assume that the property exists and deduce useful consequences of this assumption. 
The key consequence for the graph in Figure 1.4 was as follows: 

(*) if an edge ( x , y) links a 3-degree vertex x with a 2-degree vertex y then 
at most one of x and y can be used in a five-vertex edge cover 

A subsequent consequence of (*) concerning the pair x, y is that if we want to use 
vertex x (and not y) in a minimal edge cover to cover (x, y), then to cover the other 
edge at y — call it (y, z ) — vertex z would also have to be in the minimal edge cover. 

We give the mnemonic name Assumptions generate helpful Consequences — 
the AC Principle, for short — to this strategy of assuming that a graph has a desired 
property in order to deduce useful consequences, consequences we use to help us 
show that the graph indeed has this property. The AC Principle can also be used to 
show that a graph does not have some property: to do so, we deduce consequences 
under the assumption that the graph does have the property, and then show that these 
consequences lead to a contradiction. 

Example 5: Scheduling Meetings 

Consider the following scheduling problem. A state legislature has many committees 
that meet for one hour each week. One wants a schedule of committee meeting 
times that minimizes the total number of hours of meetings — but such that two 
committees with overlapping membership cannot meet at the same time. 

This situation can be modeled with a graph in which we create a vertex for 
each committee and join two vertices by an edge if they represent committees with 
overlapping membership. Suppose that the graph in Figure 1.4 now represents the 
membership overlap of 1 1 legislative committees. For example, vertex c’s edges to 
vertices b, d, and g in Figure 1 .4 indicate that committee c has overlapping members 
with committees b, d, and g. 

A set of committees can all meet at the same time if there are no edges between 
the corresponding set of vertices. A set of vertices without an edge between any two is 
called an independent set of vertices. Our scheduling problem can now be restated as 
seeking a minimum number of independent sets that collectively include all vertices. 
This problem is discussed in depth in Section 2.3. 

How many committees can meet at one time? We are asking the following graph 
question: What is the largest independent set of the graph? It is very hard in general 
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to find the largest independent set in a graph. For the graph in Figure 1.4, a little 
examination shows that there is one independent set of size 6, a, d,f, g, i, k. All other 
independent sets have five or fewer vertices. 

One goal of graph theory is to find useful relationships between seemingly unre- 
lated graph concepts that arise from different settings. Now we show that independent 
sets are closely related to edge covers. If V is the set of vertices in a graph G, then 
I will be an independent set of vertices if and only if V — I is an edge cover! Why? 
Because if there are no edges between two vertices in I , then every edge involves 
(at least) one vertex not in I — that is, a vertex in V — I . Conversely, if C is an edge 
cover so that all edges have at least one end vertex in C, then there is no edge joining 
two vertices in V — C. So V — C is an independent set. Check that in Figure 1.4, the 
vertices not in the independent set a, d,f, g, i, k form edge cover b, c, e, h,j. 

A consequence of this relationship is that if I is an independent set of largest 
possible size in a graph, then V — / will be an edge cover of smallest possible size. 
So finding a maximal independent set is equivalent to finding a minimal edge cover. ■ 

We next give an example involving directed graphs. 

Example 6: Influence Model 

Suppose psychological studies of a group of people determine which members of 
the group can influence the thinking of others in the group. We can make a graph 
with a vertex for each person and a directed edge (pi, pi) whenever person p\ 
influences p 2 . Let the graph in Figure 1 ,5a represent a set of such influences. Now 
let us ask for a minimal subset of people who can spread an idea through to the 
whole group, either directly or by influencing someone who will influence someone 
else, and so forth. In graph-theoretic terms, we want a minimal subset of vertices 
with directed paths to all other vertices (a directed path from p\ to pi- is an edge 

b b 
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sequence (p\, pf), (pf, P 3 ) . . . ( Pk-i ’, Pk ))■ Such a subset of influential vertices is 
called a vertex basis. 

To aid us, we can build a directed-patli graph for the original graph with the 
same vertex set and with a directed edge (pf pj ) if there is a directed path from p, 
to pj in the original graph. Figure 1.5b shows the directed-path graph for the graph 
in Figure 1.5a. Now our original problem can be restated as follows: Find a minimal 
subset of vertices in the new graph with edges directed to all other vertices. This is 
just a directed-graph version of the vertex-covering problem mentioned at the end 
of Example 4. Observe that any vertex in Figure 1.5b with no incoming edges must 
be in this minimal subset (since no other vertices have edges to it); vertex b is such 
a vertex. Since b has edges to a, c, and d, then e, f, and g are all that remain to be 
“influenced.” Either e,f or g “influence” these three vertices. Then b, e, or b,f or b, 
g are the desired minimal subsets of vertices. ■ 


1.1 EXERCISES 



' Summary of Exercises The first six exercises involve simple graph 
models. Exercises 7-24 present examples and extensions of the models presented in 
the examples in this section. 

1. Suppose interstate highways join the six towns A, B , C, D, E, F as follows: 1-77 
goes from B through A to E\ 1-82 goes from C through D, then through B to F; 
1-85 goes from D through A to 1-90 goes from C through E to F; and 1-9 1 goes 
from D to E. 

(a) Draw a graph of the network with vertices for towns and edges for segments 
of interstates linking neighboring towns. 

(b) What is the minimum number of edges whose removal prevents travel be- 
tween some pair of towns? 

(c) Is it possible to take a trip starting from town C that goes to every town 
without using any interstate highway for more than one edge (the trip need 
not return to C)? 

2. (a) Suppose four teams, the Aces, the Birds, the Cats, and the Dogs, play each 

other once. The Aces beat all three opponents except the Birds. The Birds 
lost to all opponents except the Aces. The Dogs beat the Cats. Represent the 
results of these games with a directed graph. 

(b) A dominance order is a listing of teams such that the ith team in the order 
beats the (i + 1) st team. Find all dominance orders for part (a). 

3. (a) A schedule is to be made with five football teams. Each team is to play two 

other teams. Explain how to make a graph model of this problem. 

(b) Show that except for interchanging names of teams, there is only one possible 
graph in part (a). 

4. Suppose there are six people — John, Mary, Rose, Steve, Ted, and Wendy — who 
pass rumors among themselves. Each day John talks with Mary and Wendy; Mary 
talks with John. Rose, and Steve; Rose talks with Mary, Steve, and Ted; Steve 
talks with Mary, Rose, Ted, and Wendy; Ted talks with Rose, Steve, and Wendy; 
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and Wendy talks with John, Steve, and Ted. Whatever people hear one day they 
pass on to others the next day. 

(a) Model this rumor-passing situation with a graph. 

(b) How many days does it take to pass a rumor from John to Steve? Who will 
tell it to Steve? 

(c) Is there any way that if two people stopped talking to each other, it would 
take three days to pass a rumor from one person to all the others? 

5. (a) Give a direction to each edge in Figure 1 .4 so that there are directed routes 

from any vertex to any other vertex. 

(b) Do part (a) so as to minimize the length of the longest directed path between 
any pair of vertices. Explain why a smaller minimum is not possible. 

6. (a) What is the length of the longest possible path (with the most vertices) in the 

graph in Figure 1.3, ignoring directions of edges? 

(b) What is the length of the longest possible circuit (with the most vertices) in 
the graph in Figure 1 .4? 

7. Find a matching, or explain why none exists for the following graphs: 

(a) (b) A 

B b B 

D ^ d D 

8. Give another reason why Figure 1.2 has no matching by considering the appro- 
priate subset of jobs (showing that they cannot all be filled). 

9. We generalize the idea of matching in Example 1 to arbitrary graphs by defining 
a matching to be a pairing off of adjacent vertices in a graph. For example, one 
possible matching in Figure 1.1 a is a-b, c-d. Which of the following graphs have 
a matching? If none exists, explain why. 

(a) Figure 1.4 (b) 


h f 

10. (a) Suppose a dictionary in a computer has a “start” from which one can branch to 

any of the 26 letters: at any letter one can go to the preceding and succeeding 
letters. Model this data structure with a graph. 

(b) Suppose additionally that one can return to “start” from letters c or k or t. 
Now what is the longest directed path between any two letters? 

11. Build the complete testing tree in Example 2 to identify one of the 26 letters of 
the alphabet. 

12. Repeat Example 2 using three-way comparisons (less than, greater than, or equal 
to) to identify one of the 26 letters. 
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13. Suppose eight current varieties of chipmunk evolved from a common ancestral 
strain through an evolutionary process in which at various stages one ancestral 
variety split into two varieties (none of the ancestral varieties survive when they 
split into two new varieties). 

(a) Explain how one might model this evolutionary process with a graph. 

(b) What is the total number of splits that must have occurred? 

14. In Example 3, find a minimal connecting set of edges containing neither (a, b) 
nor ib,c). 

15. (a) What are the other sets of two edges whose removal disconnects the graph 

in Figure 1.4 besides (a, b), (a, e) and (c, d), (d, K)1 Either produce others or 
give an argument why no others exist. 

(b) Find all sets of two vertices whose removal disconnects the remaining graph 
in Figure 1 .4. 

16. (a) For the following graph, find all sets of two vertices whose removal discon- 

nects the graph of remaining vertices. 

(b) Find all sets of two edges whose removal would disconnect the graph. 



8 f 


17. Find a minimal edge cover and a minimal set of vertices adjacent to all other 
vertices for the graph in Figure 1.2. 

18. In Figure 1.4, find all sets of three vertices that are adjacent to all the other 
vertices. Give a careful logical analysis to justify your answer. 

19. Repeat Example 4 for minimal block and corner surveillance when the network 
in Figure 1.4 is altered by adding edges (/, g), ( g,j ) and deleting (b,f). 

20. Repeat Example 4 for the edge cover and minimal corner surveillance when the 
network is formed by a regular array of north-south and east-west streets of size: 

(a) 3 streets by 3 streets (b) 4 streets by 4 streets (c) 5 streets by 5 streets 

21. (a) A queen dominates any square on a chessboard in the same row, column, or 

diagonal as the queen. How few queens can dominate all squares on an 8 by 
8 chessboard? 

(b) Repeat this problem for bishops, which dominate only diagonals. 

22. Solve the committee scheduling problem for the committee overlap graph in 
Figure 1.4. That is, what is the minimum number of independent sets needed to 
cover all vertices? 

23. (a) Find a maximum independent set in the following graphs: 

(i) Figure 1.1a (ii) Figure 1.2 
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(b) Use your result in part (a) to produce a minimal edge cover in these 
graphs. 

24. What is the largest independent set in a circuit of length 7? Of length n? 

25. (a) What is the largest independent set possible in a connected seven-vertex 

graph? Draw the graph. 

(b) What is the largest independent set possible in a seven-vertex graph (need 
not be connected)? Draw the graph. 

26. Find a vertex basis in the following directed graphs: 

(a) Figure 1.1 b (b) Figure 1.3 

(c) Figure 1.4 with edges directed by alphabetical order [e.g., edge (a, e) is 
directed from a to e ] 

27. Show that the vertex basis in a directed graph is unique if there is no sequence 
of directed edges that forms a circuit in the graph. 

28. A game for two players starts with an empty pile. Players take turns putting one, 
two, or three pennies in the pile. The winner is the player who brings the value 
of the pile up to 1 . 

(a) Make a directed graph modeling this game. 

(b) Show that the second player has a winning strategy by finding a set of four 
“good” pile values, including 16 (f, such that the second player can always 
move to one of the “good” piles (when the second player moves to one of 
the good piles, the next move of the first player must be to a non-good pile, 
and from this position the second player has a move to a good pile, etc.). 

29. The parsing of a sentence can be represented by a directed graph, with a vertex S 
(for the whole sentence) having edges to vertices Su (subject) and P (predicate), 
then Su and P having edges to the parts into which they are decomposed into 
pieces, and so on. 

Consider the abstract grammar with decomposition rules: S — > AB, S — > BA, 
A — » ABA, B — > BAS, and B — > S. For example, BAABA can be “parsed” as shown 
below. 



A 


B 




ABA 


B A 


Find a parsing graph for each of the following (or explain why no parsing 
exists): 


(a) BABABABA 


(b) BBABAABA 
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1.2 ISOMORPHISM 

In this section we investigate some of the basic structure of graphs. We are interested 
in properties that distinguish one vertex in a graph from another vertex and, more 
generally, that distinguish one graph from another graph. We motivate this discussion 
with the question: how can we tell if two graphs are really the same graph, but drawn 
differently and with different names for the vertices? For example, are the two five- 
vertex graphs in Figure 1.6 different versions of the same graph? 

A graph can be drawn on a sheet of paper in many different ways. Thus, it is usually 
possible to draw a graph in two ways that would lead a casual viewer to consider the 
drawings to be “different” graphs. This motivates the following definition. 


Two graphs G and G' are called isomorphic if there exists a one-to-one corre- 
spondence between the vertices in G and the vertices in G' such that a pair of 
vertices are adjacent in G if and only if the corresponding pair of vertices are 
adjacent in G' . 


Such a one-to-one correspondence of vertices that preserves adjacency is called 
an isomorphism. A useful way to think of isomorphic graphs is as follows: the first 
graph can be redrawn on a transparency that can be exactly superimposed over a 
drawing of the second graph. 

To be isomorphic, two graphs must have the same number of vertices and the same 
number of edges. The two graphs in Figure 1.6 pass this initial test. Both graphs have 
one vertex, e and 5, respectively, at the end of just one edge. Then any isomorphism of 
these two graphs must match e with 5. Also, the vertices at the other ends of the edge 
from e and 5 must be matched; that is, cl matches with 4. (Think of superimposing 
one graph over the other.) The remaining three vertices in each graph are mutually 
adjacent (forming a triangle) and also are all adjacent to d or 4, respectively. Thus the 
matching a with 7, b with 2, and c with 3 (or any other matching of these two subsets 
of three vertices) will preserve the required adjacencies. The correspondence a — 7, 
b — 2, c — 3, d — 4, e — 5 is then an isomorphism, and the two graphs are isomorphic. 
To visualize how they can be made to look the same, think of moving vertices 4 and 
5 in the right graph upward and to the right [past edge (7,5)], so that 7, 2, 3, 4 form a 
quadrilateral with crossing diagonals. 



Figure 1.6 t> 
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Recall that the degree deg(jt) of a vertex is the number of edges incident to the 
vertex. Degrees are preserved under isomorphism — that is, two matched vertices 
must have the same degree. Then in Figure 1.6, e has to be matched with 5 and 
d matched with 4 because they are the unique vertices of degree 1 and 4 in their 
respective graphs. Further, two isomorphic graphs must have the same number of 
vertices of a given degree. For example, if they are to be isomorphic, the two graphs 
in Figure 1 .6 must both have the same number of vertices of degree 3 — they do; both 
have three vertices of degree 3. 

A subgraph G' of a graph G is a graph formed by a subset of vertices and edges 
of G. If two graphs are isomorphic, then subgraphs formed by corresponding vertices 
and edges must be isomorphic. In Figure 1.6, removal of vertices e and 5 (and their 
incident edges) leaves two isomorphic subgraphs consisting of four mutually adjacent 
vertices. Once this subgraph isomorphism is noted, isomorphism of the whole graphs 
is easily demonstrated. 

Subgraphs can be used to test for isomorphism in the following way. If a graph 
G has a set of six vertices forming a chordless circuit of length 6 (chordless means 
there are no other edges between these six vertices except the six edges forming the 
circuit), then any graph isomorphic to G must also have a set of six vertices forming 
such a chordless 6-circuit. 

A graph with n vertices in which each vertex is adjacent to all the other vertices 
is called a complete graph on n vertices, denoted K n . A complete graph on two 
vertices, K 2 , is just an edge. Complete subgraphs are in a sense the building blocks 
of all larger graphs. For example, both graphs in Figure 1 .6 consist of a K 4 and a Ki 
joined at a common vertex. Conversely, every graph on n vertices is a subgraph of K„. 

Before examining other pairs of graphs for isomorphism, let us mention the 
practical importance of determining whether two graphs are isomorphic. Researchers 
working with organic compounds build up large dictionaries of compounds that they 
have previously analyzed. When a new compound is found, they want to know if it 
is already in the dictionary. Large dictionaries can have many compounds with the 
same molecular formula but differing in their structure as graphs (and possibly in other 
ways). Then one must test the new compound to see if its graph-theoretic structure is 
the same as the structure of one of the known compounds with the same formula (and 
the same in other ways) — that is, whether the new compound is graph-theoretically 
isomorphic to one of a set of known compounds. A similar problem arises in designing 
efficient integrated circuitry for a computer. If the design problem has already been 
solved for an isomorphic circuit (or if a piece of the new network is isomorphic to a 
previously designed circuit), then valuable savings in time and money are possible. 


Example 1: Simple Isomorphism 

Are the two graphs in Figure 1 .7 isomorphic? 

Both graphs have eight vertices and 10 edges. Let us examine the degrees of the 
different vertices. We see that b, d, f h and 3, 4, 7, 8 have degree 2, while the other 
vertices have degree 3 . Then the two graphs have the same number of vertices of degree 
2 and the same number of degree 3. The respective subgraphs of the four vertices 
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Figure 1.7 




of degree 2 (and the edges between these degree-2 vertices) in each graph must be 
isomorphic if the whole graphs are isomorphic. However, there are no edges between 
any pair of b, d, f h, while the other subgraph of degree-2 vertices has two edges: 
(4, 3) and (8, 7). So the subgraphs of degree-2 vertices are not isomorphic, and 
hence the two full graphs are not isomorphic. The reader can also check that the two 
subgraphs of degree-3 vertices in each graph are not isomorphic. ■ 


The vertices of degree 2 in the left graph in Figure 1 .7 form a subgraph of mutually 
nonadjacent vertices. Such a subgraph is called a set of isolated vertices. 

Let us review the reasoning used in Example 1. It is a contrapositive version 
of the AC Principle, Assumptions generate helpful Consequences, introduced after 
Example 4 in Section 1.1. The contrapositive statement is that if a consequence is 
false, then the assumption must be false. In this case, we assume that two graphs G 
and G' are isomorphic. A consequence of this assumption is that G 2 and G) must also 
be isomorphic, where G 2 (G)) is the subgraph of G (G') generated by its vertices of 
degree 2. For the graphs in Example 1 . the contrapositive statement is that if G 2 and G) 
are not isomorphic, then the assumption that G and G' are isomorphic must be false. 


Example 2: Isomorphism in Symmetric Graphs 

Are the two graphs in Figure 1.8 isomorphic? 

The two graphs both have seven vertices and 14 edges. Every vertex in both 
graphs has degree 4. Further, both graphs exhibit all the symmetries of a regular 
7-gon. With no distinctions possible among vertices within the same graph, our only 
option is to try to construct an isomorphism. To do this, we assume that there is an 
isomorphism and use the AC Principle to deduce properties of an isomorphism for 
these two graphs that can guide us to construct such an isomorphism. If at some point 
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in our construction a contradiction arises, then we know our assumption was false 
and there is no isomorphism. 

Start with vertex a in the left graph. By rotational symmetry, we can match a 
to any vertex in the right graph (that is, if the two graphs are isomorphic, there will 
exist an isomorphism with a matched to any vertex in the right graph). Let us use the 
match a — 1 . 

The set of neighbors of a (vertices adjacent to a) must be matched with the set 
of neighbors of 1. Let us look at the subgraphs formed by these neighbors of a and 1. 
See Figure 1.9. Both subgraphs are paths: one is/to g to b to c, and the other is 7 to 4 
to 5 to 2. The isomorphism must make these path subgraphs isomorphic. Thus, /and 
c must be matched with 7 and 2 (matching ends of the two paths). By the left-right 
symmetry of the graphs, it makes no difference which way/and c are matched — say 
/- 7 and c - 2. Then to complete the isomorphism of neighbors of a and 1, we must 
match g with 4 and b with 5. Now there remain only two unmatched vertices in each 
graph: d, e and i, 6. Vertex g is adjacent to e but not d, and its matched vertex 4 is 
adjacent to 3 but not to 6. Thus we must match e with 3 and d with 6. 

In sum, allowing for symmetries to match a with 1 and / with 7, we conclude 
that if the graphs are isomorphic, one isomorphism must be a — 7, b — 5, c — 2, d — 6, 
e — 3, f — 7, g — 4. Checking edges, we see that the graphs are indeed isomorphic 
with this matching (if this matching were found not to be an isomorphism, then the 
two graphs would not be isomorphic, since the matches we made were all forced 
except for the symmetries involving the matches of a and/). ■ 


Given a graph G — ( V , E ), its complement is a graph G — ( V , E) with the same 
set of vertices but now with edges between exactly those pairs of vertices not linked 
in G. The union of the edges in G and G forms a complete graph. Two graphs G i and 
G 2 will be isomorphic if and only if G\ and G 2 are isomorphic. The isomorphism 
problem in Example 2 is easy to answer using complements. Figure 1.10 shows the 



18 Chapter 1 Elements of Graph Theory 



Figure 1.11 


/ 

< 

7 


' ' 4 


' 

6 


L 


/ 


complements of the two graphs in Figure 1.8. Clearly, both these complementary 
graphs are just a (twisted) circuit of length 7 and hence are isomorphic. 

In general, if a graph has more pairs of vertices joined by edges than pairs not 
joined by edges, then its complement will have fewer edges and thus will probably 
be simpler to analyze. 


Example 3: Isomorphism of Directed Graphs 

Are the two directed graphs in Figure 1.11 isomorphic? 

Each graph has eight vertices and 12 edges, and each vertex has degree 3. If we 
break the degree of a vertex into two parts, the in-degree (number of edges pointed 
in toward the vertex) and out-degree (number of edges pointed out), we see that 
each graph has four vertices of in-degree 2 and out-degree 1, and each graph has four 
vertices of in-degree 1 and out-degree 2. We could try to build an isomorphism as in 
the previous example by starting with a match (by a symmetry argument) between a 
and 1 and then matching their neighbors (with edge directions also matched), and so 
forth. 

However, there is a basic difference in the directed path structure of the two 
graphs. We will exploit this difference to prove nonisomorphism. In the left graph we 
can draw a directed path from any given vertex to any other vertex by going clockwise 
around the circle of vertices: the outer edges form a directed circuit through all the 
vertices in the left graph. But in the right graph, all edges between the vertex subsets 
V\ = {7, 2, 3, 4} and Vi = {5, 6, 7, 8} are directed from V) to Vi, and so there can be 
no directed paths from any vertex in Vi to any vertex in V\ (nor is there a directed 
circuit through all the vertices). Thus, the two graphs are not isomorphic. ■ 



1.2 EXERCISES 


1. List all nonisomorphic undirected graphs with four vertices. 

2. List all nonisomorphic directed graphs with three vertices. 

3. Draw two nonisomorphic graphs with 

(a) Six vertices and 10 edges 

(b) Nine vertices and 13 edges 
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4. If directions are ignored, are the two graphs in Figure 1.11 isomorphic? 

5. Which of the following pairs of graphs are isomorphic? Explain carefully. 
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6. Which of the following pairs of graphs are isomorphic? Explain carefully, 
(a) (b) 



Ky\ 
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7 . Which pairs of graphs in this set are isomorphic? 



8. Which pairs of graphs in this set are isomorphic? 



9 . Suppose each edge in the graphs in Figure 1.8 is directed from the smaller 
(numerically or alphabetically) end vertex to the larger end vertex. Are the two 
resulting directed graphs isomorphic? 

10 . Are the following pairs of directed graphs isomorphic? 



11 . Show that all 5-vertex graphs with each vertex of degree 2 are isomorphic. 

12 . Are there any 6-vertex graphs with three edges incident to each vertex that are 
not isomorphic to one of the graphs in Exercise 7? 

13 . What are the sizes of the largest complete subgraphs in the two graphs in Exercise 
6(g)? 
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14 . Build 6-vertex graphs with the following degrees of vertices, if possible. If not 
possible, explain why not. 

(a) Three vertices of degree 3 and three vertices of degree 1 

(b) Vertices of degrees 1, 2, 2, 3, 4, 5 

(c) Vertices of degrees 2, 2, 4, 4, 4, 4 



1.3 EDGE COUNTING 


There is very little in the way of general assertions that can be made about all graphs. 
There is one useful general theorem, a formula for counting edges. 

Theorem 1 

In any graph, the sum of the degrees of all vertices is equal to twice the number of 
edges. 


Proof 


Summing the degrees of all vertices counts all instances of some edge being incident 
at some vertex. But each edge is incident with two vertices, and so the total number 
of such edge-vertex incidences is simply twice the number of edges. The theorem is 
now proved. ♦ 

As an illustration of Theorem 1, consider the graph in Figure 1.12 with six 
vertices, three of degree 4, two of degree 3, and one of degree 2. The sum of the 
degrees is 4 + 4 + 4+ 3 + 3 + 2 = 20. This sum must equal twice the number of 
edges. The reader can check that the number of edges in this graph is 10. 

For the sum of degrees to be an even integer, there must be an even number of 
odd integers in the sum. Thus we obtain the following 

Corollary 

In any graph, the number of vertices of odd degree is even. 

Let us now look at uses of this theorem and corollary. 


b 


Figure 1.12 f 
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Example 1: Use of Theorem 1 

Suppose we want to construct a graph with 20 edges and have every vertex of degree 4. 
How many vertices must the graph have? 

Let v denote the number of vertices. The sum of the degrees of the vertices will be 
4v, and by the theorem this sum must be twice the number of edges: 4v = 2 x 20 = 40. 
Hence v = 10. ■ 

Example 2: Edges in a Complete Graph 

How many edges are there in K n , a complete graph on n vertices? 

Recall that K n has an edge between all possible pairs of vertices. At any given 
vertex, there will be edges going to each of the n— 1 other vertices in K „ , and so 
each vertex has degree n — 1 . The sum of the degrees of all n vertices in K„ will be 
n(n — 1). Since this sum equals twice the number of edges, the number of edges is 
n(n — l)/2. ■ 

Example 3: Impossible Graph 

Is it possible to have a group of seven people such that each person knows exactly 
three other people in the group? 

If we model this problem using a graph with a vertex for each person and an edge 
between each pair of people who know each other, then we would have a graph with 
seven vertices all of degree 3. But this is impossible by the Corollary — the number of 
vertices of odd degree must be even — and so no such set of seven people can exist. ■ 


Recall that a graph G is connected if every pair of vertices in G is joined by a 
path in G. If G is not connected, its vertices can be partitioned into connected pieces, 
called components. Formally, a component H is a connected subgraph of G such that 
there is no path between any vertex in H and any vertex of G not in H. The component 
of G containing a particular vertex x consists of x and all vertices that may be reached 
from x by a path in G. Because each component of G is a graph in its own right, 
this section’s Corollary applies to each component as well as to G. We next present 
a puzzle that seems to have no relation to graphs. 

Example 4: Mountain Climbers Puzzle 

Two people start at locations A and Z at the same elevation on opposite sides of a 
mountain range whose summit is labeled M. See Figure 1.13a. We pose the following 
puzzle: is it possible for the people to move along the range in Figure 1.13a to meet 
at M in a fashion so that they are always at the same altitude every moment? We shall 
show this is possible for any mountain range like Figure 1.13a. The one assumption 
we make is that there is no point lower than A (or Z) and no point higher than M. 

We make a range graph whose vertices are pairs of points (Pi, Pr) at the same 
altitude with Pi on the left side of the summit and Pr on the right side, such that one 
of the two points is a local peak or valley (the other point might also be a peak or 
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Figure 1.13 


valley). The vertices for the range in Figure 1.13a are shown in the graph in Figure 
1.13b. We make an edge joining vertices {Pi, P R ) and (P' L , P' R ) if the two people can 
move constantly in the same direction (both going up or both going down) from point 
Pi to point P’ L and from P R to P' R , respectively. Our question now: is there a path in 
the range graph from the starting vertex (A, Z ) to the summit vertex (M, M)? For the 
graph in Figure 1.13b, the answer is obviously yes. 

We claim that vertices (A, Z) and ( M , M) in any range graph have degree 1, 
whereas every other vertex in the range graph has degree 2 or 4. (A, Z) has degree 1 
because when both people start climbing up the range from their respective sides, 
they have no choice initially but to climb upward until one arrives at a peak. In Figure 
1.13a, the first peak encountered is C on the left, and so the one edge from (A, Z) 
goes to (C, X). A similar argument applies at ( M , M). Next consider a vertex {Pi, P R ) 
where one point is a peak and the other point is neither peak nor valley, such as 
{E, W). From the peak we can go down in either direction: at W, we can go down 
toward Z or toward U. In either direction, the people go until one (or both) reaches 
a valley. At {E, W), the two edges go to (I), Y ) and (I), U). Thus such a vertex has 
degree 2. A similar argument applies if one point (but not both) is a valley. It is left 
as an exercise for the reader to show that if a vertex {Pi, If) consists of two peaks 
or two valleys, such as ( D , U), it will have degree 4. (A vertex consisting of a valley 
and a peak will have degree 0 — why?) 

Suppose there were no path from (A, Z) to {M, M ) in the range graph. Thus, 
these two vertices are in different components of the range graph. We use the fact that 
starting vertex (A, Z) and summit vertex ( M,M ) are the only vertices of odd degree to 
obtain a contradiction. The component of the range graph consisting of (A, Z) and all 
the vertices that can be reached from (A, Z) would form a graph with just one vertex 
of odd degree, namely, (A, Z). This contradicts the Corollary, and so any range graph 
must have a path from (A, Z) to (M, M). m 

Many interesting properties in graph theory are dependent on certain sets of edges 
having even size. Euler cycles, discussed in Section 2.1, arise when all vertices have 
even degree. 

In Example 2 of Section 1.1, we considered a matching problem involving the 
graph shown in Figure 1.14. The vertices on the left represented people and the vertices 
on the right represented jobs. An edge links a left vertex to a right vertex to indicate 


1.3 Edge Counting 27 



that a certain person can perform a certain job. There can never be an edge between 
two vertices on the left or between two vertices on the right. Such a graph is called a 
bipartite graph. Formally, a graph G is bipartite if its vertices can be partitioned into 
two sets Vj and V 2 and every edge joins a vertex in Vj with a vertex in V 2 . 

Bipartite graphs can be characterized by all circuits in such graphs having even 
length (if there are no circuits, the graph is also bipartite), where the length of a circuit 
or path is the number of edges in it. 


Theorem 2 

A graph G is bipartite if and only if every circuit in G has even length. 


Proof 

Note that it is sufficient to prove this theorem for connected bipartite graphs. We 
claim that if the theorem is true for each connected component of a disconnected 
bipartite graph G, then it is true for G (components were formally defined just above 
Example 4). This claim follows from G’s being bipartite if and only if each of its 
components is bipartite, and any circuit in G’s having even length if and only if any 
circuit in each of its components has even length. 

First we show that if G is bipartite, then any circuit has an even length. If G is 
bipartite so that it can be drawn with all edges connecting a left vertex with a right 
vertex, then any circuit X 1 -X 2 -X 3 ■ — x n -x\ has alternately a left vertex, then a right 
vertex, then a left vertex, and so on, assuming the first vertex x\ is on the left. Odd- 
subscripted vertices are on the left, and even-subscripted vertices on the right. See 
Figure 1.15a. Since x„ is adjacent to x\, x n must be on the right, so its subscript is 
even. That is, there are an even number of vertices in the circuit. Any circuit has the 
same number of edges as vertices, and thus this circuit has even length. 



Figure 1.15 
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Suppose next that any circuit in G (there may be no circuits) has even length. We 
show how to construct a bipartite arrangement of G. We use the AC Principle and 
assume that a bipartition exists, and use properties of that bipartition to construct it. 
Take any given vertex, call it a, and put it on the left. Put all vertices adjacent to a on 
the right. Next put all vertices that are two edges away from a , that is, at the end of 
some path of length 2 from a, on the left. In general, if there is a path of odd length 
between a and a vertex x, put x on the right. If there is a path of even length between 
a and x, put x on the left. 

There cannot be distinct paths P and P' between a and x of odd and of even 
lengths, respectively, since taking P from a to x and then returning to a on P' yields 
an odd-length circuit. This is impossible, since all circuits have even length. (If P and 
P' have a vertex q in common besides a and x, then a further argument is needed to 
show that there is a circuit of odd length. See Exercise 15 for details.) 

Similarly, we argue that there cannot be an edge between two vertices, say, b and c, 
both on the left. There must exist even-length paths Q, Q' joining a with b and c, 
respectively (since b and c are on the left). See Figure 1 . 1 5b, in which Q is dashed 
and Q' is dotted. Observe that Q' followed by the edge (c, b ) yields an odd-length 
path from a to b. This is impossible, since we just proved that there cannot be both 
an even-length path ( Q ) and an odd-length path < Q' plus (a. b )) from a to any other 
vertex in G. By similar reasoning, two vertices on the right cannot be adjacent. Thus, 
we have a bipartite arrangement of G. ♦ 

Example 5: Testing for a Bipartite Graph 

Is the graph in Figure 1 . 16« bipartite? 

Pick any vertex, say a, and put it on the left. We follow the approach in the second 
half of the proof of Theorem 2. Put vertices joined to a by an even-length path on 
the left and vertices joined to a by an odd-length path on the right. If all the circuits 
in this graph are even-length, then the reasoning in the above proof guarantees that 
our placement of vertices will yield a bipartite arrangement. If we end up with two 
vertices on the left (or on the right) being adjacent, then the graph cannot be bipartite. 
In this case, the construction succeeds, as shown in Figure 1.16 b. m 



Figure 1.16 


(b) 
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1.3 EXERCISES 


1. How many vertices will the following graphs have if they contain: 

(a) 12 edges and all vertices of degree 3. 


(b) 21 edges, three vertices of degree 4, and the other vertices of degree 3. 

(c) 24 edges and all vertices of the same degree. 


2. For each of the following questions, describe a graph model and then answer the 

question. 

(a) Must the number of people at a party who do not know an odd number of 
other people be even? 

(b) Must the number of people ever born who had (have) an odd number of 
brothers and sisters be even? 

(c) Must the number of families in Alaska with an odd number of children be 
even? 

(d) For each vertex x in the following graph, let.v(x) denote the number of vertices 
(including x ) adjacent to at least one of x’s neighbors. Must the number of 
vertices with s(x ) odd be even? Is this true in general? 



3. What is the largest possible number of vertices in a graph with 19 edges and all 
vertices of degree at least 3? 

4. Is any subgraph of a bipartite always bipartite? Prove, or give a counterexample. 

5. What constraint must be placed on a bipartite graph G to guarantee that G’s 
complement will also be bipartite? 

6. If a graph G has n vertices, all of which but one have odd degree, how many 
vertices of odd degree are there in G, the complement of G? 

7. Show that a complete graph with m edges has (1 + 8m)/2 vertices. 

8. Let G be an /z- vertex graph that is isomorphic to its complement G. How many 
edges does G have? (Hint: Use Exercise 5.) 

9. Suppose all vertices of a graph G have degree p, where p is an odd number. Show 
that the number of edges in G is a multiple of p. 

10. There used to be 26 football teams in the National Football League (NFL) with 13 
teams in each of two conferences (each conference was divided into divisions, 
but that is irrelevant here). An NFL guideline said that each team’s 14-game 
schedule should include exactly 1 1 games against teams in its own conference 
and three games against teams in the other conference. By considering the right 
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part of a graph model of this scheduling problem, show that this guideline could 
not be satisfied! 

11. Prove a directed version of Theorem 1 : The sum of the in-degrees of vertices in 
a directed graph equals the sum of the out-degrees of vertices, and further, each 
sum equals the number of edges. 

12 . Build the range graph for each of the following mountain ranges and use the 
graph to find a solution to the problem in Example 4. 




13 . Prove in a range graph that if a vertex {Pi, Pr) consists of two peaks or two 
valleys, it will have degree 4. 

14 . Prove in a range graph that if a vertex {Pi, Pr) consists of a valley and a peak, 
it will have degree 0. 

15 . Determine whether the following graphs are bipartite. If so, give the partition 
into left and right vertices as in Figure 1.16ft. 

(a) Figure 1.4 (b) Figure 1.7 (left graph) (c) Figure 1.12 

16 . Determine whether the following graphs are bipartite. If so, give the partition 
into left and right vertices as in Figure 1.16ft. 



a 



17 . Suppose x and y are the only two vertices of odd degree in graph G, and x and y 
are not adjacent to each other. Show that G is connected if and only if the graph 
obtained from G by adding edge (x, y) is connected. 

18 . In the second part of the proof of Theorem 2, one can encounter the situation 
in which there exist paths P and P' between a and x, P of odd length and P' of 
even length, and these two paths have one or more vertices in common. One must 
show that a subset of the edges on these two paths forms an odd-length circuit. 
Let q be the first vertex on P, starting from a, that also lies on P' . Show that either 
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the circuit from a along P to q and then back on P' to a, or the edge sequence 
from q along P to x and then back on P' to q, has odd length. In the latter case, if 
the edge sequence is not a circuit, then it has a vertex q' on both P and P'. Repeat 
the same reasoning considering the circuit on P from q to q’ and then back on 
P' to q or the edge sequence from q’ along P to x and back along P' to q’ . 



1.4 PLANAR GRAPHS 

The most natural examples of graphs are street maps and telephone networks. The 
graphs that arise from such physical networks usually have the property that they can 
be depicted on a piece of paper without edges crossing (different edges meet only at 
vertices). We say that a graph is planar if it can be drawn on a plane without edges 
crossing. We use the term plane graph to refer to a planar depiction of a planar graph. 
The two graphs in Figure 1.17 are both planar. The graph in Figure 1.17b is a plane 
graph. The graph in Figure 1 . 17 a is planar, since it can be redrawn in the form of the 
graph in Figure 1.17b. 

Our principal focus in this section is determining whether a graph is planar. We 
take two approaches, both based on the AC Principle. The first approach involves a 
systematic method for trying to draw a graph edge-by-edge with no crossing edges, 
in the same spirit as when we tried to determine if two graphs are isomorphic. The 
second approach develops some theory with a goal of finding useful properties of 
planar graphs. If a graph does not satisfy one or more of these properties, then we 
know that it cannot be planar. 

Remember that if a graph G has been drawn with edges crossing, this does not 
mean the graph is nonplanar. There may be another way to draw the graph without 
edges crossing, as illustrated by the graph in Figure 1.17a, which can be redrawn to 
be the plane graph in Figure 1.17b. 

Probably the most important need today for testing whether a graph is planar 
arises in designing electronic circuits. Complex integrated circuits are nonplanar and 
require several layers of (planar) circuit connections in their wiring. But the number 
of layers is limited and so a major problem in integrated circuit design is decomposing 
a large circuit into a minimal number of subcircuits that are known to be planar. 


a 



(«) (b) 


Figure 1.17 
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A more mundane, but still important, use of planarity testing arises in checking 
data-entry errors in planar networks. When a large planar graph such as a city’s street 
network is entered on data terminals for computerized analysis, it is a common error- 
checking technique to test first whether the graph as typed in is indeed planar (most 
data-entry errors would make the graph nonplanar). 

Planar graphs were first studied extensively by mathematicians over 100 years 
ago in connection with a map-coloring problem. 

Example 1: Map Coloring 

One of the most famous problems in mathematics concerns map coloring. The 
question is how many colors are needed to color countries on some map so that any 
pair of countries with a common border are given different colors. A map of countries 
is a planar graph with edges as borders and vertices where borders meet. See 
Figure 1 . 1 8a . However, a closely related planar graph called a dual graph of the map 
graph is more useful. The dual graph is obtained by making a vertex for each country 
and an edge between vertices corresponding to two countries with a common border. 
See Figure 1.1 8ft. Normally, a vertex is also included for the unbounded region 
surrounding the map. 

The question in the dual graph now is how many colors are needed to “color” the 
vertices such that adjacent vertices have different colors. In Figure 1.18ft, vertices A, B, 
C, D form a complete subgraph on four vertices and so each requires a different color, 
four colors in all. With four colors, we can also properly color the remaining vertices. 

One of the most famous unsolved problems in all of mathematics during the last 
century was the conjecture that all planar graphs can be properly colored with only 
four colors. In trying to resolve this conjecture, mathematicians developed a large 
theory about planar graphs. In 1976 the four-color conjecture was proven true by 
Appel and Haken using an immense computer-generated, case-by-case, exhaustive 
analysis (there were 1955 classes of graphical configurations to be considered, each 
involving numerous subcases). We will take a closer look at graph coloring in the 
next chapter. ■ 

We now use the AC Principle to try to find a planar drawing of a graph. We 
assume it to be planar. As with isomorphism between two graphs, we want to be 




Figure 1.18 
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able to conclude that a graph is not planar if our construction fails. We shall call 
our approach the circle-chord method. It starts by finding a circuit that contains all 
the vertices of our graph (though such circuits do not exist for all graphs, they are 
common in the types of graphs we will be considering in this section). We draw this 
circuit as a large circle. The remaining noncircuit edges, which we will call chords, 
must be drawn either inside the circle or outside the circle in a planar drawing. 

We choose a first chord and draw it, say, outside the circle. If properly chosen, 
this chord will force certain other chords to be drawn inside the circle (if also placed 
outside the circle, they would have to cross the first chord). These inside chords will 
force still other chords to be drawn outside, and so on. After the first chord is drawn, 
the choice of placing subsequent chords inside or outside is forced. Thus, if we reach 
a point where a new chord will have to cross some previous chord, whether the new 
chord is drawn inside or outside, we can claim that the graph must be nonplanar. That 
is, our construction based on the assumption that the graph had a planar depiction led 
to a contradiction. If all the chords can be added without crossing other chords, then 
the graph is planar. 

A critical decision is whether the first chord drawn should go inside or outside the 
circle. We claim that it makes no difference, because of the following inside-outside 
symmetry of a circle. Consider two maps of the earth, the first with the North Pole 
at the center of the map, the second with the South Pole at the center. Suppose each 
map has a path drawn in the Northern Hemisphere linking two cities on the equator. 
In the first map (with the Northern Hemisphere inside the equator) the path is inside 
the equator’s circle, whereas in the second map the path is outside the equator. Think 
of the circle formed by the circuit in the circle-chord method as the equator and the 
first chord as the path between two cities. Whether the chord is inside or outside the 
circle is just a matter of which “map” of the earth one uses. 

We note that very efficient algorithms exist to test whether a graph is planar, 
whereas there is no efficient algorithm known to test whether two graphs are isomor- 
phic. The planarity testing algorithms are fairly complicated and beyond the scope of 
this text. 


Example 2: Circle-Chord Method 

Use the circle-chord method to determine whether the graph in Figure 1.19a is planar. 
Let us look for a circuit with all eight vertices. One possibility is a-f-c-h-d-g-b-e-a. 


abed 




Figure 1.19 


(a) 


(b) 
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Now try to add the other four edges (a, h), (b, /), ( c , g), (d, e). By inside-outside 
symmetry, we can start by drawing (a, h) outside. See Figure 1.19 b. Then (b, f) and 
(c, g ) must go inside. Then (d, e) must go outside. So the graph is planar, as shown in 
Figure 1.19 b. ■ 


Example 3: Showing K 3 , 3 Is Nonplanar 

Show that ^3,3, the graph in Figure 1 .20a, is nonplanar. The notation K 33 indicates 
that this graph is a complete bipartite graph consisting of two sets of three vertices 
with each vertex in one set adjacent to all vertices in the other set. Applying the 
circle-chord method, we form a circuit containing all six vertices in /C3 3, and then 
try to add the remaining edges (not in the circuit) as inside and outside chords. 

There are several choices for a 6-vertex circuit. Suppose we use the circuit 1-4- 
2-5-3-6-1 and draw it in a circle as shown in Figure 1 .20 b. Next the edges (7,5), (2, 6), 
and (5, 4) must be added. First draw chord (7, 5). By the inside-outside symmetry of 
a circle discussed above, we can assume that (7, 5) is drawn inside the circuit, as in 
Figure 1.207>. Then (2, 6) must be drawn outside the circuit to avoid crossing chord 
(7, 5). Finally, we must draw (5, 4): if drawn outside the circuit, (3, 4) would have to 
cross chord (2, 6); if drawn inside the circuit, (3, 4) would have to cross chord (7, 5). 
Thus 7^3 3 cannot be drawn in a planar depiction. Hence K 33 is nonplanar. ■ 

Using a mixture of theory and careful, case-by-case analysis, it is possible to 
prove that any nonplanar graph always contains a ^3 3 or a K$ (the complete graph 
on five vertices shown in Figure 1.21a) as a subgraph or a slight modification of these 
two graphs. It is left as an exercise to show that the circle-chord method (used in 
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Example 3 ) can be used to show that K$ is nonplanar. Thus, these two graphs are the 
“reason” that a graph cannot be drawn in a planar fashion. 

We have to allow a slight variation in K 3,3 and K$ in nonplanarity analysis. Figure 
1.21 b shows a A'3 3, graph that has been subdivided by adding vertices in the middle 
of some of its edges. The resulting graph is no longer a /I3 3 and does not contain 
W3.3 as a subgraph, yet it is still nonplanar (repeatedly adding a vertex in the middle 
of an edge cannot make a nonplanar graph planar). We say that a subgraph is a K 3 3 
configuration if it can be obtained from a ^33 by adding vertices in the middle of 
some edges. A K$ configuration is defined similarly. The following planar graph 
characterization theorem was first proved by the Polish mathematician Kuratowski. 

Theorem 1 (Kuratowski, 1930) 

A graph is planar if and only if it does not contain a subgraph that is a K$ or X33 
configuration. 

If the circle-chord method shows that a graph is nonplanar, then by Theorem 1 
this graph has a subgraph that is a K$ or K3 3 configuration. Finding such a configu- 
ration can sometimes be tricky. However, the following observation is helpful: Most 
small nonplanar graphs contain a K33 configuration. All but one of the nonplanar 
graphs in the exercises have /O3 configurations. Note also that the depiction of a 
W33 in Figure 1.20 h as a 6-vertex circle with three chords joining pairs of opposite 
vertices is the way that a ^33 configuration normally arises in a nonplanar graph. 


Example 4: Finding a K 3,3 

Use the circle-chord method to determine whether the graph in Figure 1 . 22 a is planar. 
If it is nonplanar, find a subgraph that is a W33 configuration. 

First we seek a circuit that visits all vertices. Many such circuits exist. Choose 
the circuit a-b-c-d-e-f-g-h-a, shown in Figure 1 . 22 /?. Say we pick (a, d) as our first 
chord to draw. By inside-outside symmetry, it makes no difference whether we draw 
(a, d) inside or outside the circuit. Put it inside. Then (b, e) must go outside to avoid 
intersecting (a, d). Next look for another chord reaching across the circle. Observe 





Figure 1.22 
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that chord (c, g) cannot be drawn inside without crossing (a, d), nor drawn outside 
without crossing ( b , e). So the graph is nonplanar. 

Next we look for a A '33 configuration. We can simplify the problem by restricting 
our attention to the nonplanar subgraph in Figure 1 .22/;, whose crossing edges proved 
that the full graph had to be nonplanar. A A '33 configuration has six vertices of degree 
3 (corresponding to the vertices of a A' 33 ) plus some number of vertices of degree 2 
that subdivide the edges of a K 33 . The way to find a A^ 3 configuration in a subgraph 
is to eliminate edge subdivisions in the graph (remove each vertex of degree 2 and 
combine the two edges at each such vertex into a single edge). Figure 1.22c shows 
the subgraph in Figure 1.22 b with subdivisions removed. The graph in Figure 1.22c 
looks just like the depiction of a AT ? ? in Figure 1.20 h. Thus, the subgraph in Figure 
1 . 22 b was a K 3 3 configuration. ■ 

Finding a A '33 configuration in the graph in Figure 1.22a (without using the 
subgraph in Figure 1.22 b) would be difficult. The challenging problem in finding 
a A '33 configuration in a general nonplanar graph is the following. Let z be some 
vertex of degree 3 in the original graph and suppose that just two of the z’s edges, say 
(z, r) and (z, q), are part of a A '3 3 configuration. Then z corresponds to a subdivision 
vertex in this K$ 3 configuration, and these two edges of z need to be fused into a 
single edge (r, q ) to find the underlying A '3 3 . That is, z disappears and a new edge 
(r, q) is created. Using the subgraph produced by the circle-chord method makes it 
much easier to identify vertices of degree 2 in a A '3 3 configuration whose two edges 
should be fused together. 

There are many different plane graph depictions that can be drawn for a planar 
graph. For example, we can redraw the plane graph in Figure 1.23a by making the 
region bounded by the triangle (d, e, f) very large and bringing vertex a to the right 
side, as in Figure 1.23£>. Now flip the part of the graph above and to the right of the 
triangle inside the triangle, obtaining the plane graph in Figure 1.23c. The triangle 
(d, e, /) has become the outside boundary of the whole graph. The boundary of any 
region can be converted to the outside boundary of the whole graph by a similar 
process. 

Despite this variability in plane graph depictions of a planar graph, one important 
property of the plane depictions does not change. The number of regions is always 
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the same. For simplicity, assume that G is a connected planar graph. (Recall that 
connected means having paths between every pair of vertices.) If v and e denote the 
number of vertices and edges, respectively, in G, then a plane graph depiction of G will 
always have a number of regions r given by the formula, r = e — v + 2. Remember 
that the unbounded region outside the graph is counted as a region. This remarkable 
formula for r was discovered by Euler 360 years ago. 

Theorem 2 Euler’s Formula (1752) 

If G is a connected planar graph, then any plane graph depiction of G has r = e — v + 2 
regions. 

Proof 

Let us draw a plane graph depiction of G edge by edge. We choose successive edges 
so that at every stage we have a connected subgraph. Let G n denote the connected 
plane graph obtained after n edges have been added, and let v„, e„, and r„ denote 
the number of vertices, edges, and regions in G„, respectively. Initially we have G\, 
which consists of one edge, its two end vertices, and the one (unbounded) region. Then 
d = 1, Vi = 2, ri = 1, and so Euler’s formula is valid for Gi, since ri = ei — Vi + 2: 
1 = 1 — 2 + 2. We obtain G 2 from G 1 by adding an edge at one of the vertices in G\ . 
In general, G„ is obtained from G„_i by adding an nth edge at one of the vertices of 
G„_i . The new edge might link two vertices already in G„_i . If it does not, the other 
end vertex of the nth edge is a new vertex that must be added to G„ . 

We will now use the method of induction (see Appendix A. 2) to complete the 
proof. We have shown that the theorem is true for G 1 . Next we assume that it is true 
for G„_ 1 for any n > 1, and prove that it is true for G„ . Let (x, y) be the nth edge that 
is added to G ;! _ 1 to get G„ . There are two cases to consider. 

In the first case, x and y are both in G„_ 1 . Then they are on the boundary of a 
common region K of G„_ 1 , possibly the unbounded region [if x and y were not on a 
common region, edge (x, y ) could not be drawn in a planar fashion, as required]. See 
Figure 1.24a. Edge (x, y) splits K into two regions. Then r„ = r„_i + 1, e„ = e„_i + 1, 
v„ = v„ _ 1 . So each side of Euler’s formula grows by 1. Hence, if the formula was 
true for G„_ 1 , it will also be true for G„. 

In the second case, one of the vertices x, y is not in G„_ 1 — say it is x. See Figure 
1 .24 b. Then adding (x, y) implies that x is also added, but no new regions are formed 
(i.e., no existing regions are split). Thus r„ = r„_i, e„ = e„_i + 1, v„ = v„_i + 1, and 
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the value on each side of Euler’s formula is unchanged. The validity of Euler’s formula 
for G „- 1 implies its validity for G„. 

So each increase in r is balanced in Euler’s formula by an increase in e or v. By 
induction, the formula is true for all G „ ’s and hence true for the full graph G. ♦ 


Example 5: Using Euler’s Formula 

How many regions would there be in a plane graph with 10 vertices each of degree 3? 

By Theorem 1 in Section 1.3, the sum of the degrees, 10 x 3, equals 2e, and so 
e = 15. By Euler’s formula, the number of regions r is 

r = e-v + 2= 15- 10 + 2 = 7 ■ 

Theorem 2 has the following corollary that can often be used to show quickly 
that a graph is nonplanar. 


Corollary 

If G is a connected planar graph with e > 1 , then e < 3v — 6. 


Proof 

Let us define the degree of a region analogously to the degree of a vertex to be the 
number of edges incident to a region — that is, the number of edges on its boundary. 
If an edge occurs twice along a boundary, as does (x, y ) in region K in Figure 1.25a, 
the edge is counted twice in region 1C s degree; for example, region K has degree 10 
and region L has degree 3 in Figure 1.25a. Observe that each region in a plane graph 
must have degree >3, for a region of degree 2 would be bounded by two edges joining 
the same pair of vertices and a region of degree 1 would be bounded by a loop edge 
(see Figure 1.25b), but parallel edges and loops are not allowed in graphs. 

Since each region in a plane graph has degree >3, the sum of the degrees of 
all regions will be at least 3r. But this sum of degrees of all regions must equal 2e, 
since this sum counts each edge twice, that is, each of an edge’s two sides is part of 
some boundary (this is the same type of argument as used to show that the sum of the 
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vertices’ degrees equals 2e). Thus, 2e = (sum of regions’ degrees) > 3r, or |e>r. 
Combining this inequality with Euler’s formula (Theorem 2), we have 


§e>r = e — v + 2 


or 0 > |e — v + 2 


Solving for e, we obtain e < 3 v — 6 . ♦ 

Example 6: Showing That K$ Is Nonplanar 

Use the Corollary to prove that K 5 , the complete graph on 5 vertices, is nonplanar. 

The graph K 5 has v = 5 and e = 10 (see Example 2 in Section 1.3 for how to 
find the number of edges in a complete graph). Then 3v — 6 = 3x5 — 6 = 9. But 
the Corollary says that e < 3v — 6 must be true in a connected planar graph, and so 
assuming K$ is a planar graph leads to a contradiction — namely, that the Corollary is 
not true. So K$ cannot be planar. ■ 

The Corollary should not be misinterpreted to mean that if e < 3v — 6 , then a 
connected graph is planar. Many nonplanar graphs also satisfy this inequality. For 
example, A " 3 3 with v = 6 and e = 9 satisfies it. 

Our two theorems and corollary have laid the foundation for a mathematical 
theory of planar graphs. In the process, we have acquired a practical aid for showing 
that a graph is nonplanar. One way to extend this theory is to make the inequality in 
the Corollary “stronger” — that is, to get a smaller upper bound on e. Recall that the 
key step in proving the Corollary was the observation that every region has degree at 
least 3. This led to the inequality 2e > 3r. Suppose that a certain connected graph G 
(with at least two edges) is known to be bipartite. By Theorem 2 in Section 1.3, all the 
circuits in a bipartite graph have even length. Then no region in this graph can have 
degree 3 (since this would imply a boundary circuit of length 3). Then every region 
in a bipartite planar graph must have degree >4. Summing the degrees of all regions, 
we now obtain the inequality 2e = (sum of degrees of regions) > 4r. Reworking the 
Corollary with the inequality 2e > 4r, we have |e> r = e — v + 2, and hence 


e < 2v — 4 


Every connected planar graph that is bipartite must satisfy this inequality. Con- 
sider our “favorite” bipartite graph K 33 . X 33 has v = 6 and e = 9 and, as noted 
above, satisfies the corollary inequality e < 3v — 6 . But since it is bipartite, X 3 3 
would also have to satisfy the new inequality e < 2v — 4 if it were planar. It does not: 
9 2 x 6 — 4. 


1.4 EXERCISES 



' Summary of Exercises The first six exercises involve determining 
whether various graphs are planar and drawing planar graphs in different ways. Exer- 
cise 10 involves duality. Exercises 15-26 build on Euler’s formula and the corollary 
e < 3v — 6 . The other exercises introduce new concepts. 


40 


Chapter 1 Elemen ts of Graph Theory 


1. Draw a dual graph of the planar graph (include a vertex for the unbounded region 
outside the graph) in 

(a) Figure 1.1 lb 

(b) Figure 1.186 


2. Show that is nonplanar by the method in Example 2. 

3. Which of the following graphs are planar? Find ^33 or K 5 configurations in the 


nonplanar graphs (almost all are AT 33) . 

(a) (b) (c) (d) 





4. Redraw the graph in Figure 1.196 so that the infinite region is bounded by the 
circuit a-f-c-h-a. 

5. (a) For what values of n is K„ planar? 

(b) For what values of r and s is the complete bipartite graph K rs planar? (K r s 
is a bipartite graph with r vertices on the left side and s vertices on the right 
side and edges between all pairs of left and right vertices.) 
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6. A complete tripartite K r s t is a generalization of a complete bipartite graph (see 
part (b) of the previous exercise). There are three subsets of vertices: r in the 
first subset, 5 in the second subset, and t in the third subset. Every vertex in one 
particular subset is adjacent to every vertex in the other two subsets; that is, a 
vertex is adjacent to all vertices except those in its own subset. Determine all the 
triples r, s, t for which K r s J is planar. 

7. In each case, give the values of r, e, or v (whichever is not given) assuming that 
the graph is planar. Then either draw a connected, planar graph with the property, 
if possible, or explain why no such planar graph can exist. 

(g) Six regions all with four boundary 
edges 

(h) Seven vertices all of degree 3 

(i) 12 vertices and every region has 
four boundary edges 

(j) 17 regions and every vertex has 
degree 5 

8. If a connected planar graph with n vertices all of degree 4 has 10 regions, deter- 
mine n. 

9. A line graph L( G ) of a graph G has a vertex of L(G) for each edge of G and an 
edge of L(G) joining each pair of vertices corresponding to two edges in G with 
a common end vertex. 

(a) Show that L(K^) is nonplanar. 

(b) Find a planar graph whose line graph is nonplanar. 

10 . The construction of a dual D(G ) can be applied to any plane graph G: draw a 
vertex of D(G) in the middle of each region of G and draw an edge e* of D(G ) 
perpendicular to each edge e of G; e* connects the vertices of D(G) representing 
the regions on either side of e. 

(a) A dual need not be a graph. It might have two edges between the same pair of 
vertices or a self-loop edge (from a vertex to itself). Find two planar graphs 
with duals that are not graphs because they contain these two forbidden 
situations. 

(b) Show that the duals of the two different plane depictions of the graph in 
Figures 1.23a and 1.23c are isomorphic. 

(c) Show that the degree of a vertex in the dual graph D(G ) equals the number 
of boundary edges of the corresponding region in the planar graph G. 

(d) Find a planar graph that is isomorphic to its own dual. 

(e) Show for any plane depiction of a graph G that the vertices of G correspond 
to regions in D(G). 


(a) Six vertices and seven regions 

(b) Eight vertices and 13 edges 

(c) Six vertices and 14 edges 

(d) 14 edges and nine regions 

(e) Six vertices all of degree 4 

(f) Five regions and 10 edges 


11. (a) Show that if a circuit in a planar graph encloses exactly two regions, each of 
which has an even number of boundary edges, then the circuit has even length. 
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(b) Show that if a circuit in a planar graph encloses a collection of regions, each of 
which has an even number of boundary edges, then the circuit has even length. 

12. The crossing number c(G) of a graph G is the minimum number of pairs of 
crossing edges in a depiction of G. For example, if G is planar, then c(G ) = 0. 
Determine c(G) for the following graphs: 

(a) Ki'i (b) K$ (c) Figure 1.22a 

13. A graph G is critical nonplanar if G is nonplanar but any subgraph obtained by 
removing a vertex is planar. 

(a) Which of the following graphs are critical nonplanar? 

(i) K 3 i (ii) K $ (iii) Figure 1.22a 

(b) Show that critical nonplanar graphs must be connected and cannot have a 
vertex whose removal disconnects the graph. 

14. A graph G is called maximal planar if G is planar but the addition of another 
edge between two nonadjacent vertices will make the graph nonplanar. 

(a) Show that every region of a connected, maximal planar graph will be trian- 
gular. 

(b) If a connected, maximal planar graph has n vertices, how many regions and 
edges does it have? 

15. Suppose G is a planar graph that is not necessarily connected, as required in 
Euler’s formula (Theorem 2). Recall that a component H is a connected subgraph 
of G with the property that there is no path between any vertex in H and any 
vertex of G not in H. 

(a) Find the appropriate modification of Euler’s formula for a planar graph with 
c components. 

(b) Show that the corollary is valid for unconnected planar graphs. 

16. Prove that if a graph G has 1 1 vertices, then either G or its complement G must be 
nonplanar. (Hint: Determine the total number N\ i of edges in a complete graph 
on 1 1 vertices; if the result were false and G and its complement were each planar, 
how many of the An edges could be in each of these two graphs?) 

17. Mimic the argument in the corollary to prove that e < 3r — 6 in a planar graph 
with each vertex of degree >3. 

18. (a) Prove that every connected planar graph has a vertex of degree at most 5. 

[Hint: Assume that every vertex has degree at least 6 and obtain a contradic- 
tion as follows. Get a lower bound, involving v, on the sum of the degrees 
of vertices (similar to the lower bound on the sum of the degrees of regions 
obtained in the proof of the corollary). Since this sum of degrees equals 2e, 
the corollary’s bound on e can be used to get an upper bound, also involving 
v, on this sum of degrees. Combining these two bounds yields the desired 
contradiction.] 
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(b) Show that part (a) immediately generalizes to any (unconnected) planar 
graph. 

19. Prove that every connected planar graph with less than 1 2 vertices has a vertex of 
degree at most 4. [Hint: Assume that every vertex has degree at least 5 to obtain 
a lower bound on e (together with the upper bound on e in the corollary) that 
implies v > 12.] 

20. If G is a connected planar graph with all circuits of length at least k, show that the 
inequality e < 3v — 6 can be strengthened to e < ^(v — 2). (Hint: The degree 
of a region will be at least k .) 

21. (a) Show that every circuit in the graph in Exercise 3(1) has at least five edges, 
(b) Use part (a) and the result of Exercise 20 to show that this graph is nonplanar. 

22. (a) Give an example of a graph with regions consisting solely of squares (regions 

bounded by four edges) and hexagons, and with vertices of degree at least 3. 

(b) Write an expression for the sum of the degrees of the vertices (=2e) in such 
a graph in terms of v and s, the number of squares. Then use Exercise 17 to 
get an upper bound on 2e. Deduce that any graph of the sort defined in part 
(a) has at least six squares. 

(c) If each vertex has degree 3, show that any graph of the sort defined in part 
(a) has exactly six squares. 

23. If G is a connected planar graph where e = 3v — 6, show that every region is 
triangular (has three boundary edges). 

24. A Platonic graph is a planar graph in which all vertices have the same degree d\ 
and all regions have the same number of bounding edges di, where d i > 3 and 
d 2 > 3. A Platonic graph is the “skeleton” of a Platonic solid, for example, an 
octahedron. 

(a) If G is a Platonic graph with vertex and face degrees cl\ and d 2 , respectively, 
then show that e = ^d\ \ and r = (d\/d 2 )\. 

(b) Using part (a) and Euler’s formula, show that \(2di + 2<r/o — c/kT) = 442. 

(c) Since v and 4 A are positive integers, we conclude from part (b) that 2d] + 
2d 2 — d\d 2 > 0. Use this inequality to prove that (d\ — 2)(d2 — 2) < 4. 

(d) From part (c), find the five possible pairs of positive (integral) values of d\, cT. 

25. Suppose that / lines are drawn through a circle and these lines form p points of 
intersection (involving exactly two lines at each intersection). How many regions 
r are formed inside the circle by these lines? Assume that the lines end at the 
edge of the circle at 21 distinct points. 

26. Consider an overlapping set of four circles A,B,C, D. One would like to position 
the circles so that every possible subset of the circles forms a region, e.g., four 
regions each contained in just one (different) circle, six regions formed by the 
intersection of two circles ( AB , AC, AD, BC, BD, CD), four regions formed by the 
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intersection of three of the four circles, and one region formed by the intersection 
of all four circles. Prove that it is not possible to have such a set of 15 bounded 
regions. 

27. Show that the following graphs can be drawn on the surface of a doughnut (torus) 
without crossing edges: 


(a) K 3,3 


(b) K 5 


(c) K 6 




1.5 SUMMARY AND REFERENCES 


This chapter introduced graphs, their applications, and some of their basic structures. 
This text takes a user-oriented approach to graph theory. Readers interested in a 
more formal graph theory text presenting the subject as an interesting area of pure 
mathematics should see the books by Bondy and Murty [1], West [3], or Wilson [4J. 
Section 1 . 1 introduced a set of illustrative graph models. The basic structure of graphs 
was explored in Section 1.2 under the guise of determining what makes two graphs 
different. Section 1.3 presented some useful edge-counting results. The final section 
introduced the important class of planar graphs. It surveyed ad hoc and theoretical 
approaches for determining whether a graph is planar. 

The history of graph theory begins with the work of L. Euler in 1736 on Euler 
cycles (discussed in Section 2.1). Euler’s formula for planar graphs, originally stated in 
terms of polyhedra, was proved in 1752. Bits of graph theory appeared in papers about 
topology and geometric games, but it was not until around 1850 that formal studies 
of graphs began to appear. One was A. Cayley’s 1857 paper counting the number of 
trees (discussed in Chapter 3). Another was G. Kirchhoff’s 1847 paper presenting 
an algebra of circuits and introducing graphs in the study of electrical circuits. This 
same paper contains Kirchhoff’s famous current and voltage laws (Kirchhoff was 21 
when he wrote this historic paper). The term graph was first used by J. Sylvester in 
1877. The first book on graph theory, by D. Konig, did not appear until 1936. An 
excellent sourcebook on the history of graph theory is Graph Theory 1736-1936 by 
Biggs, Lloyd, and Wilson [2], 

See the General References (at the end of the book) for a list of other introductory 
texts on graph theory. 

1. J. A. Bondy and U. S. R. Murty, Graph Theory with Applications, American 

Elsevier, New York, 1976. 

2. N. Biggs, E. Lloyd, and R. Wilson, Graph Theory 1736-1936, Cambridge 

University Press, Cambridge, 1999. 


3. D. West, Introduction to Graph Theory, 2nd ed., Prentice-Hall, Saddle River, 
N. J., 2001 

4 . R. Wilson, Introduction to Graph Theory, 4th ed., John Wiley and Sons, New 
York, 1997. 
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SUPPLEMENTARY EXERCISES 


Summary of Exercises Graph theory is a field famous for its in- 
teresting problems. Several exercises introduce new graph concepts, such as strong 
connectedness (Exercise 14) and cut-set (Exercise 25). Exercise 30 is a very famous 
problem in Ramsey theory (see Appendix A. 4). For more problems in graph the- 
ory, see any of the graph theory texts listed in the References at the end of this 
text. 

1. Suppose that there are seven committees with each pair of committees having a 
common member and each person being on two committees. How many people 
are there? 

2. Show that the complement of K„, a complete graph on n vertices, is a set of n 
isolated vertices. 

3. A graph is regular if all vertices have the same degree. If a graph with n vertices 
is regular of degree 3 and has 18 edges, determine n. 

4. If a graph has 50 edges, what is the least number of vertices it can have? 

5. Show that at least two vertices have the same degree in any graph with at least 
two vertices. {Hint: Be careful about vertices of degree 0.) 

6. Show that an undirected graph with all vertices of degree > 2 must contain a 
circuit (edges cannot be repeated in a circuit). 

7. If every vertex in the graph G has degree 2, does every vertex lie on a circuit? 
Prove, or give a counterexample. 

8. If every vertex in a graph G has degree > cl, then show that G must contain a 
circuit of length at least d + 1 . 

9. If every vertex in a directed graph G has positive out-degree (at least one out- 
wardly directed edge), 

(a) Must G contain a directed circuit? 

(b) Must every vertex of G be on a directed circuit? 

10. If G is a connected graph that is not a complete graph, show that some vertex, 
call it x, has two neighbors, call them y, z, that are not adjacent to each other [that 
is, there are edges (x, y) and (x, z ) but not edge (v, z)]. 

11. Show that if a graph is not connected, then its complement must be connected. 

12. Show that removal of some vertex x disconnects the connected undirected graph 
G if and only if there are two vertices a and b in G such that all paths in G from 
a to b pass through x. 

13. Let G be a connected graph such that G-x is not connected for all but two vertices 
x of G. Show that G is a path. 

14. A directed graph G is called strongly connected if there is a directed path from 
x to y for any two vertices x, y in G. Direct the edges in the following graphs to 
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make the graphs strongly connected. If not possible, explain why. (An application 
of this problem is making streets in a city one-way.) 

(a) Figure 1.4 (b) Figure 1.2 

15 . Prove that G is strongly connected (see Exercise 14) if and only if G’s vertices 
cannot be partitioned into two sets Vi , \f such that there are no edges from a 
vertex in Vi to a vertex in VT 

16 . A bridge is an edge in a connected graph whose removal disconnects the graph. 
Show that if a graph G contains a bridge, then it cannot have a circuit that contains 
all vertices of G. 

17 . Show that if every edge in a connected graph G lies on a circuit, then G cannot 
have a bridge (a bridge is defined in Exercise 16). 

18 . Prove that the edges of a connected undirected graph G can be directed to create 
a strongly connected graph (see Exercise 14) if and only if there is no bridge in 
G (see Exercise 16). 

19 . Draw planar graphs with the following types of vertices, if possible: 

(a) Six vertices of degree 3 

(b) i vertices of degree i, i— 1, 2, 3, 4, 5 

(c) Two vertices of degree 3 and four vertices of degree 5 

20 . Let I be a set of independent vertices in a graph G and let C be a set of vertices 
that forms a complete subgraph. Show that I and C have at most one vertex in 
common. 

21 . Is it possible to have a connected simple graph containing six vertices with 
degrees: 5, 2, 2, 2, 2, 1? If so, draw such a graph. 

22 . Mr. Megabucks invites three married couples to his penthouse for dinner. Upon 
arrival, Mr. Megabucks and the six guests shake the hands of some of the other 
people (none of the guests shakes hands with his or her own spouse). Suppose 
each of the six guests shakes a different number of hands (possibly one person 
shakes no hands). By building a graph model of this situation (with seven ver- 
tices for the six guests and Mr. Megabucks), determine exactly how many hands 
Mr. Megabucks must have shaken. {Hint: First determine the six different num- 
bers of hands shaken by the different guests.) 

23 . A round-robin tournament can be represented by a complete directed graph with 
vertices for competitors and an edge ( a P, b) if a beats b. Each competitor plays 
every other competitor once. A vertex’s (competitor’s) score will be its out-degree 
(number of victories). Show that if vertex x has a maximum score among vertices 
in a round-robin tournament, then for any other vertex y, either there is an edge 
(xp y ) or for some w there are edges (xp w) and (wp y). 

24 . Suppose the round-robin tournament graph (see Exercise 23) has no directed 
circuits. We define a ranking of vertices (competitors) as follows. A vertex with 
no outward edge has a rank of 0. In general, a vertex has rank k if it has an 
edge directed to a rank k — 1 vertex and all other edges directed to lower ranks 
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(< k — 1). Show that a directed complete graph with no directed circuits always 
has such a ranking, and that each vertex will have a different rank. 

25 . A trail is a sequence of vertices with consecutive vertices joined by a distinct 
edge (no edge can be repeated). Unlike a path, a vertex can be visited any number 
of times in a trail. A cycle is a trail that starts and ends at the same vertex. A cycle 
that repeats no vertices is a circuit. 

(a) Show that a subset of the vertices on a trail from x to y can be used to make 
a path from x to y. 

(b) Prove, or give a counterexample: If x and y lie on a cycle, then they must lie 
on a circuit. 

(c) Show that the edges in a cycle can be partitioned into a collection of circuits. 

(d) Show that if C is an odd-length cycle, then a subset of C’s edges forms an 
odd-length circuit. 

26 . Consider a collection of circles (of varying sizes) in the plane. Make a circle 
graph with a vertex for each circle and an edge between two vertices when they 
correspond to two circles that cross (if one circle properly contains another, there 
would be no edge). 

(a) Draw a family of circles whose circle graph is isomorphic to K 4 . 

(b) Draw a family of circles whose circle graph is the graph in Figure 1.3 
(ignoring edge directions). 

(c) Draw a family of circles whose circle graph is isomorphic to K 3,3. 

27 . A cut-set S is a set of edges in a connected undirected graph G whose removal 
disconnects G, but such that no proper subset of S can disconnect G. 

(a) Find a cut-set of minimal size in Figure 1.22a. 

(b) Show that every cut-set has an even number of edges in common with any 
circuit (remember that 0 is an even number). 

28 . A graph with n vertices and n + 2 edges must contain two edge-disjoint circuits. 
Prove or give a counterexample. 

29 . Show that if an n-vertex graph has more than \(n — l)(n — 2) edges, then it must 
be connected. {Hint: The most edges possible in a disconnected graph will occur 
when there are two components, each complete subgraphs.) 

30 . Show that an n-vertex graph cannot be a bipartite graph if it has more than ^n 2 
edges. 

31 . Suppose that G is a connected graph containing no triangles and that every pair 
of two non-adjacent vertices in G has exactly two neighbors in common. Show 
that every vertex of G must have the same degree. {Hint: Show that any pair of 
adjacent vertices must have the same degree.) 

32 . {Famous Ramsey theory problem ) Let each edge of a complete graph on six 
vertices be painted red or white. Show that there must always be either a red 
triangle of three edges or a white triangle of three edges. 
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33. (a) Find a graph that is isomorphic to its own complement. 

(b) Show that any self-complementary graph [as in part (a)] must have either 4k 
or 4k + 1 vertices, for some integer k. {Hint: Use the fact that G and G both 
must have the same number of edges.) 

34. Suppose that each path in a certain 7-vertex planar graph contains an even 
number of edges (zero edge or two edges or four edges, etc.). Draw the graph. 
{Hint: This is a “trick” problem.) 

35. Show that a directed graph has no directed circuits if and only if its vertices can 
be indexed xi, Xi, ■ ■ . , x n , so that all edges are of the form (xf, x ; ), i < j . 

36. Let K m n be a complete bipartite graph, with m > n. What is the size of the 
smallest edge cover of K mn 2 What is the size of the largest independent set? 

37. A line graph L{G ) of a graph G has a vertex of L < G ) for each edge in G and 
an edge between two vertices in L(G) corresponding to two edges of G with a 
common end vertex. 

(a) Draw a line graph of the left graph in Figure 1.6. 

(b) Show that each vertex in L(K n ) has degree 2{n — 2). 

(c) Find all graphs that are isomorphic to their own line graph. 

38. Show that if a graph H is the line graph (see Exercise 37) of some graph, then 
the edges of H can be partitioned into a collection of complete subgraphs such 
that each vertex of H is in exactly two such complete subgraphs. 

39. An automorphism of a graph is an isomorphism (1 — 1 mapping preserving 
adjacency) of the vertices of a graph with themselves. Find an automorphism of 
the graph in 

(a) Figure 1.1a (b) Figure 1.4 (c) Figure 1.13 

40. (a) Show that there is no way to pair off the 14 vertices in the graph below with 

seven edges. 

(b) Generalize part (a) to the problem of trying to use 31 dominoes to cover 
the 62 squares of an 8 x 8 chessboard with its two opposite corner squares 
removed. 


41. Suppose circuits C\ and C 2 have common edges (but C\ f Cf). Show that the 
edges in (Ci , U Cf) — (C 1 , fl C 2 ) form a circuit (or collection of circuits). 

42. If the graph G has 2n vertices and no triangles, then show that G cannot have 
more than n 2 edges. 


CHAPTER 2 

COVERING CIRCUITS 
AND GRAPH COLORING 



EULER CYCLES 


In this chapter we examine two graph-theoretic concepts that have many important 
applications. One is edge sequences, which visit either all the edges in a graph once or 
all the vertices once. The other is graph coloring, a concept introduced in Example 1 
of Section 1.4. 

In some applications, such as a highway network linking a group of cities, it is 
natural to permit several edges to join the same pair of vertices. Such generalized 
graphs are called multigraphs; loop edges, of the form (a, a), are also allowed in 
multigraphs. Figure 2.1 b displays a multigraph. 

At the beginning of Chapter 1, we defined a path P — x \ —X 2 — — x n to be a 
sequence of distinct vertices with each pair of consecutive vertices in P joined by an 
edge. When there is also an edge ( x„ , X \ ), the sequence is called a circuit, written 
x i —X 2 — ■ —x„-X] . Just as we generalized our notion of a graph to a multigraph, now 
we generalize our definitions of path and circuit in order to allow repeated vertices. A 
trail T = x \ -X 2 — ■ —x n is a sequence of vertices (not necessarily distinct) in which, 
like a path, consecutive vertices are joined by an edge. However, no edge can be 
repeated in a trail. In Figure 2.1 b, A-B-D-B-C is a trail (we assume that segments 
B-D and D-B on this trail are using the two different edges joining B and D). When 
there is also an edge ( x n ,x \ ), the sequence of vertices is called a cycle, written 
X | —X2~ ■ — x n — X | . 


Example 1: Konigsberg Bridges 

The old Prussian city of Konigsberg was located on the banks of the Pregel River. 
Part of the city was on two islands that were joined to the banks and each other 
by seven bridges, as shown in Figure 2.1a. The townspeople liked to take walks, or 
Spaziergangen, about the town across the bridges. Several people were apparently 
bothered by the fact that no one could figure out a walk that crossed each bridge 
just once, for they brought this problem to the attention of the famous mathemati- 
cian Leonhard Euler. He solved the Spaziergangen problem, thereby giving birth to 
graph theory and immortalizing the Seven Bridges of Konigsberg in mathematics 
texts. 
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(b) 


We can model this walk problem with a multigraph having a vertex for each 
body of land and an edge for each bridge. See Figure 2.1/?. The desired type of walk 
corresponds to what we now call an Euler cycle. An Euler cycle is a cycle that contains 
all the edges in a graph (and visits each vertex at least once). Readers should convince 
themselves that no Euler cycle exists for this multigraph. ■ 

A multigraph possessing an Euler cycle will have to have an even degree at each 
vertex, since each time the cycle passes through a vertex it uses two edges. A second 
obvious property is that the multigraph must be connected. Euler showed that these 
two properties are also sufficient to guarantee the existence of an Euler cycle. Before 
proving this theorem, let us first build an Euler cycle by ad hoc methods in a graph 
that is connected and has even-degree vertices. Then we extend our construction to a 
general proof of the Euler cycle theorem. 

Observe first that the even-degree condition is so powerful that it creates an 
apparent paradox. Suppose we start at some vertex a in a graph G with all even- 
degree vertices and start building an Euler cycle by tracing out an arbitrary sequence 
of edges (without repeating any edges). The assumption of all even degrees implies 
not only that there will be an even number of edges the first time we come to any 
vertex, but also that there will be even number of unused edges after we leave that 
vertex, since entering and leaving a vertex uses two edges. This in turn implies that 
every time we come to that vertex again, it will have an even number of unused edges. 
We can only be forced to stop if we enter a vertex with just one unused edge (the edge 
by which we enter the vertex), but we just showed that this can never happen. So it 
appears that we are able to continue our sequence forever, even though any graph has 
only a finite number of edges. The paradox is resolved by noting that when we started 
off from a (leaving it without first having entered it), a became an odd-degree vertex 
and so our sequence of edges must eventually end at a. 

Example 2: Building an Euler Cycle 

Build an Euler cycle for the graph in Figure 2.2 a that is connected and has even-degree 
vertices. 

Let us start by blindly tracing out a trail from vertex a. Recall that a trail is like 
a path, except that vertices may be repeated. Suppose we go a-d-j-n-o-k-l-h-f-e-b-a. 
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Now we are back to a. Note that the even degree property means that we can always 
leave any vertex we enter except a. There are no vertices of degree 1 or that are 
reduced to degree 1 after being visited several times. Thus, any trail we trace from a 
will never be forced to end at another vertex and so must eventually come back to a, 
forming a cycle; in this case the cycle is a circuit (no vertex is visited twice). 

Next consider the graph of remaining edges, shown in Figure 2.2 b. It is no longer 
connected, but all vertices still have even degree (removing the cycle reduced each 
degree by an even amount). Each connected piece of the remaining graph has an Euler 
cycle: d-c-i-j-k-e-d and h-g-m-h. These two cycles can be inserted into the original 
cycle at vertices d and /?, respectively, yielding the Euler cycle a-d-c-i-j-k-e-d-j-n-o- 
k-l-h-g-m-h-f-e-b-a. ■ 


Theorem 1 (Euler, 1736) 

A multigraph has an Euler cycle if and only if it is connected and has all vertices of 
even degree. 


Proof 

The proof generalizes the construction in Example 2. As noted above, when an Euler 
cycle exists, the multigraph must be connected and have all vertices of even degree. 

Suppose a multigraph G satisfies these two conditions. Then pick any vertex a 
and trace out a trail. As in Example 2, the even-degree condition means that we are 
never forced to stop at some other vertex, and so eventually the trail must terminate at 
a (possibly it passes through a several times before finally being forced to stop at a). 

Recall that upon leaving a at the start, a now has odd degree. This is why we 
are eventually forced to stop at a. Let C be the cycle thus generated and let G' be the 
multigraph consisting of the remaining edges of G-C. As in Example 2, G' may not 
be connected but it must have all vertices of even degree. Since the original graph 
was connected, C and G' must have a common vertex or else there is no path from 
vertices in C to vertices in G' . Let a' be such a common vertex. Now build a cycle 
C' tracing through G' from a' just as C was traced in G from a. Incorporate C' into 
the cycle C at a', as done in Example 2, to obtain a new larger cycle C'. Repeat this 
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process by tracing a cycle in the graph G" of still remaining edges and incorporate 
the cycle into C' to obtain C". Continue until there are no remaining edges. ♦ 

Let us now consider an application of Euler cycles to an urban systems problem. 

Example 3: Routing Street Sweepers 

Suppose the graph of solid edges in Figure 2.3 represents a collection of blocks to be 
swept by a street sweeper in a certain district of some city from 10 A.M. to 1 1 A.M. 
(when parking on these blocks is forbidden). The looping edge at k represents a circular 
street. We want a tour that sweeps each solid edge once. That is, we want an Euler 
cycle of the solid edges. Unfortunately, the graph of edges to be swept in such applica- 
tions rarely has all vertices of even degree. Frequently the graph is also not connected. 
In such cases we must traverse extra edges, called deadheading edges, to obtain the 
desired tour. This creates a new problem: minimizing the number of deadheading 
edges. 

The desired tour will be an Euler cycle of the multigraph of sweeping and dead- 
heading edges — a multigraph because some edges may be repeated in deadheading. 
By the theorem, this multigraph must be connected and have all vertices of even 
degree. Thus to minimize deadheading, the deadheading edges should be a minimal 
set of extra edges with the property that when added to the original graph all vertices 
have even degree and the new multigraph is connected. Suppose the dashed edges in 
Figure 2.3 are such a minimal set of extra edges. 

There is one more problem to be faced once we have a multigraph possessing an 
Euler cycle. We want to build an Euler cycle that minimizes turns at corners, especially 
U-turns. Even right-hand turns can tie up traffic in busy cities. We can successively 
look at each corner (vertex) and pair off the edges at the corner so as to minimize 
disruption on the tour’s passes through that corner. For the multigraph in Figure 2.3, 
we would go straight through corners d, e, h, i on visits to these corners; we would 
go straight through j once and turn once [going between (/, j) and (k, /)]; we must 
turn both times at k; and at all other corners (of degree 2) we would have forced turns. 
The short lines near each vertex in Figure 2.3 indicate these edge pairings. Of course, 
such edge pairings are very unlikely to produce a single Euler cycle. In Figure 2.3, we 
get two cycles from these pairings: a-b-e-i-m-l-h-d-a and c-d-e-f-j-k-k-j-i-h-g-c. Now 
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we break the two cycles at a common vertex and fuse them together. For example, at 
d, change the edge pairings to 



d 



The result is an Euler cycle with optimal pairings at all but one corner. 

A large street network with 200 edges typically forms only three or four cycles 
when minimal-disruption edge pairing is performed at each corner, and so only a 
few pairings need to be changed. In practice, it is also necessary to use directed 
graphs since street sweepers cannot move against the traffic on their side (curb) of the 
street. ■ 

It is interesting to note that the street sweeping problem gives rise to an alternative 
proof of the Euler cycle theorem as follows. 

Alternative Proof of Theorem 

The even-degree condition means that we can pair off (and link together) the edges 
at each vertex. These linked edges form a set of cycles. The connectedness condition 
means that these cycles have common vertices where they can be joined to form a 
single cycle — an Euler cycle. ♦ 

This proof is not as intuitive or pictorial as our original path-tracing method, but 
it is both simpler and more applicable. 

We conclude this section by extending the concept of an Euler cycle to an Euler 
trail. An Euler trail is a trail that contains all the edges in a graph (and visits each 
vertex at least once). 

Corollary 

A multigraph has an Euler trail, but not an Euler cycle, if and only if it is connected 
and has exactly two vertices of odd degree. 

Proof 

Suppose a multigraph G has an Euler trail but not an Euler cycle. Call it T. Then the 
starting and ending vertices of T must have odd degree while all other vertices have 
even degree (by the same reasoning that showed all vertices have even degree in an 
Euler cycle). Also the graph must be connected. 

On the other hand, suppose that a multigraph G is connected and has exactly 
two vertices, p and q, of odd degree. Add a supplementary edge ( p, q) to G to obtain 
the graph G' . G' is connected and has all vertices of even degree. Hence by the Euler 
cycle theorem, G' has an Euler cycle, call it C. Now remove the edge ( p, q) from C. 
This removal reduces the Euler cycle to an Euler trail that includes all edges of G. ♦ 
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2.1 EXERCISES 


Summary of Exercises The first four exercises involve trying to 
build Euler cycles. Exercises 5-8 and 1 1-14 present extensions and other questions 
related to the Euler cycle theorem. The remaining questions involve some modeling 
and further theory, and the last two exercises ask for computer programs. 


1. (a) Build an Euler cycle for the right graph in Figure 1.8. 

(b) Build an Euler trail for the graph in Figure 1.22a with edge ( d , g) removed. 

2. (a) For which values of n does K n , the complete graph on n vertices, have an 

Euler cycle? 

(b) Are there any K n that have Euler trails but not Euler cycles? 

(c) For which values of r and .s' does the complete bipartite graph K rs have an 
Euler cycle? 

3. Find a graph G with seven vertices such that G and its complement both have an 
Euler cycle. 

4. What is the minimum number of times one must raise one’s pencil in order to 
draw the graph in Figure 1.4? 

5. (a) Can a graph with an Euler cycle have a bridge (an edge whose removal 

disconnects the graph)? Prove or give a counterexample. 

(b) Give an example of a 10-edge graph with an Euler trail that has a bridge. 

6. Give an argument for and an argument against the statement that a 1 -vertex graph 
(with no edges) has an Euler cycle. 

7. Suppose that in the definition of an Euler cycle, we drop the seemingly superfluous 
requirement that the Euler cycle visit every vertex and require only that the cycle 
include every edge. Show that now the theorem is false. Draw a graph that 
illustrates why the theorem is now false. 

8. Prove that if a connected graph has a 2k vertices of odd degree, then there are k 
disjoint trails that contain all the edges. 

9. The matrix below marks with a 1 each pair of the set of racers A, B, C, D, E, F 
who are to have a drag race together. It is most efficient if a racer can run in two 
races in a row (but not in three in a row). Is it possible to design a sequence of 
races such that one of the racers in each race (except the last race) also runs in the 
following race (but not three in a row)? If possible, give the sequences of races 
(pairs of racers); if not, explain why not. 


A B C D E F 


A 

B 

C 

D 

E 

F 


— 10 110 
1—110 1 
0 1 — 010 
1 10 — 11 
1 0 11 — 1 
0 10 11 — 
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10 . Is it possible for a knight to move around an 8 x 8 chessboard so that it makes 
every possible move exactly once (consider a move between two squares con- 
nected by a knight to be completed when the move is made in either direction)? 

11 . Show that in Example 3 the minimum set of deadheading edges in any sweeping 
problem will be a collection of edges forming paths between pairs of different 
odd-degree vertices. 

12 . Prove the directed version of the Euler cycle theorem: a directed multigraph has 
a directed Euler cycle if and only if the multigraph is connected (when directions 
are ignored) and the in-degree equals the out-degree at each vertex. 

(a) Model your proof after the argument in the proof of the theorem. 

(b) Model your proof after the argument in the alternative proof of the theorem. 

(c) Build a directed Euler cycle for the graph below. 

a b c d e 



13 . State and prove a directed multigraph version of the corollary. 

14 . A directed graph is called strongly connected if there is a directed path from 
any given vertex to any other vertex. Show that if a directed graph possesses a 
directed Euler cycle, then it must be strongly connected. 

15 . Try to find a minimal set of edges in the graph below whose removal produces 
an Euler cycle. {Hint: Tricky.) 


a 



16 . The line graph L{G) of a graph G has a vertex for each edge of G, and two of 
these vertices are adjacent if and only if the corresponding edges in G have a 
common end vertex. 

(a) Show that L{G) has an Euler cycle if G has an Euler cycle. 

(b) Find a graph G that has no Euler cycle but for which L(G) has an Euler cycle. 

17 . Consider the following algorithm due to Fleury for building an Euler cycle, 
when one exists, in a single pass through a graph (without later adding side 
cycles as in the proof of the theorem). Starting at a chosen vertex a, build a 
cycle and erase edges after they are used (also erase vertices when they become 
isolated points). The one rule to follow in the cycle building is that one never 
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chooses an edge whose erasure will disconnect the resulting graph of remaining 
edges. 

(a) Apply this algorithm to build Euler cycles for the graph in 

(i) Figure 2.2a (ii) Figure 2.3 (including deadheading edges) 

(b) Prove that this algorithm works. 

(c) Does this algorithm work for Euler trails? Explain. 

18 . Suppose we are given an undirected connected graph representing a network of 
two-way streets. 

(a) Show that there always exists a tour of the network in which a person drives 
along each side of every street once. 

(b) Show that the tour in part (a) can be generated by the following rule: at any 
intersection, do not leave by the street first used to reach this intersection 
unless all other streets from the intersection have been used. 

19 . A set of eight binary digits (0 or 1 ) are equally spaced about the edge of a disk. 
We want to choose the digits so that they form a circular sequence in which every 
subsequence of length three is different. Model this problem with a graph with 
four vertices, one for each different subsequence of two binary digits. Make a 
directed edge for each subsequence of three digits whose origin is the vertex with 
the first two digits of the edge’s subsequence and whose terminus is the vertex 
with the last two digits of the edge’s subsequence. 

(a) Build this graph. 

(b) Show how an Euler cycle (which exists for this graph) will correspond to the 
desired 8-digit circular sequence. 

(c) Find such an 8-digit circular sequence with this graph model. 

(d) Repeat the problem for 4-digit binary sequences. 

20 . Write computer programs for finding an Euler cycle, when one exists, in a multi- 
graph: 

(a) Use the method in the proof of the theorem. 

(b) Repeat part (a) for a directed graph. 

(c) Use the method in the alternative proof of the theorem. 


21 . Write a program to implement the algorithm in Exercise 17. 



In this section we explore Hamilton circuits and paths, circuits and paths that visit 
each vertex in a graph exactly once. Hamilton circuits arise in operations research 
problems such as routing a delivery truck that must visit a set of stores. In these 
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applications, the most efficient solution will be obtained by finding a minimal-cost 
Hamilton circuit (this problem is discussed in Section 3.4). 

The problem of determining whether a graph has an Euler cycle was answered 
by the Euler cycle theorem, which tells us that a graph has an Euler cycle if and 
only if all vertices have even degree and the graph is connected. Such a nice, sim- 
ple answer is very unusual in graph theory (which is probably why the Euler cycle 
theorem was the first result proved in the field of graph theory). In this sec- 
tion, we return to graph theory normalcy: There is no simple way to determine 
whether or not an arbitrary graph has a Hamilton circuit or a Hamilton path. In- 
deed, finding a Hamilton circuit or path in a graph is an NP-complete problem (see 
Appendix A. 5). 

Finding Hamilton circuits by inspection, when they exist, is usually not too hard 
in moderate-sized graphs, but proving that no Hamilton circuit exists in a given graph 
can be very difficult. Such a proof typically involves the same type of reasoning with 
the AC Principle needed to show that two graphs are not isomorphic or that a graph 
is nonplanar. At the end of this section we present a sampling of the theory that has 
been developed about the existence of Hamilton circuits. These theorems give various 
special conditions on a graph that guarantee the existence of a Hamilton circuit. Most 
graphs satisfy none of these conditions. 

Our focus will be proving that a Hamilton circuit does not exist in particular 
graphs. This nonexistence problem requires the type of systematic logical analysis 
that is the essence of most applied graph theory. To prove nonexistence, we must 
begin building parts of a Hamilton circuit and then show that the construction must 
always fail. This is similar to the way in Section 1.4 that we showed a graph to be 
nonplanar by building a circuit and then adding chords in a structured fashion that 
forced two edges to cross. 

Our analysis uses three simple rules that must be satisfied by the set of edges used 
to build a Hamilton circuit. The first step in applying the AC Principle ( Assumptions 
generate helpful Consequences ) is that if a graph has a Hamilton circuit, then any 
such Hamilton circuit must contain exactly two edges incident to each vertex. From 
this consequence three further consequences follow. 

Rule 1. If a vertex x has degree 2, both of the edges incident to x must be 
part of any Hamilton circuit. 

Rule 2. No proper subcircuit — that is, a circuit not containing all 
vertices — can be formed when building a Hamilton circuit. 

Rule 3. Once the Hamilton circuit is required to use two edges at a vertex 
x, all other (unused) edges incident at x must be removed from 
consideration. 

Example 1: Nonexistence of Hamilton Circuit I 

Show that the graph in Figure 2.4 has no Hamilton circuit. 

We can apply Rule 1 at vertices a, b, d, and e. To indicate that the two edges at 
each of these vertices must be used, we draw a little line segment from one edge to 
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Figure 2.4 d 

the other, as shown in Figure 2.4. There are two types of contradictions that have now 
arisen. First, using Rule 1 for vertices a and d forces the Hamilton circuit to use edges 
(a, c), (a, d), (d, c) in forming a triangle; this violates Rule 2. Second, using Rule 1 
for all four vertices of degree 2 forces the Hamilton circuit to contain more than two 
edges incident at c. Either of these difficulties proves that this graph has no Hamilton 
circuit. ■ 

In the next two examples, our three rules will be used in the following sort of 
sequential reasoning. When Rule 3 requires us to delete edges at some vertex q, one 
of q’ s deleted edges may go to a vertex r of degree 3. After the deletion of (q, r), r has 
degree 2, requiring by Rule 1 that its remaining two edges be used. As certain edges 
are forced to be deleted, other edges are forced to be used, which in turn forces still 
other edges to be deleted, and so on. 

Example 2: Nonexistence of Hamilton Circuit II 

Show that the graph in Figure 2.5 has no Hamilton circuit. 

We can apply Rule 1 at vertices a and g, so that the subpaths b-a-c and e-g-i 
must be part of any Hamilton circuit. Next consider vertex i. We already know that 
(g, i) must be part of the circuit. Since the graph is symmetric with respect to edges 
( i , j ) and (i, k), it does not matter which of these two edges we choose as the other 
edge incident to i on the Hamilton circuit. Suppose we pick (/, j). If we obtain a 
contradiction using ( i , j), then by symmetry we would also obtain a contradiction 
with ( i , k). The situation can be visualized as follows: if we marked (i, j) and then 
held a page with the graph in front of us (facing away from us) and viewed it in a 
mirror, it would appear that we had chosen (;, k)\ every subsequent move starting 
with (i, j ) would have a mirror image move starting from (i, k). 

Having chosen to use ( i , j) as the second edge at i, we delete the other edge at i, 
(. i , k), by Rule 3. See Figure 2.5. Each time we apply a rule, we need to check how 



a 



2.2 Hamilton Circuits 59 


other edges and vertices are affected. Does any vertex now have only two remaining 
edges at it, allowing Rule 1 to be applied? Or are two edges now required to be used 
at some vertex, allowing Rule 3 to be applied? Also, is there some edge that, if used, 
would complete a subcircuit, allowing Rule 2 to be used to delete it? 

Deleting ( i , k) reduces the degree of k to 2, and so Rule 1 requires that we use 
both remaining edges incident at k, (j, k ) and (h, k). Edge (J, k ) is the second edge 
used that is incident to j. Then by Rule 3, j’s other edge (/, j) must be deleted. This 
deletion reduces the degree of / to 2. See Figure 2.5. So we must use the two remaining 
edges at/, ( b , /) and (/, e ). Edge ( b , /) is the second edge used at b, and so edge 
(. b , d ) at b must be deleted. Similarly, edge (/, e) is the second edge used at e, and so 
e’s other edges, (e, d) and (e, h ), must be deleted. (Note that after having determined 
the two edges at j and k, we have the path e-g-i-j-k-h, and the edge e-h must again 
be deleted by Rule 2 to avoid a subcircuit.) 

Deleting ( e , h ) forces the use at h of (c, h). Using (c, h ) forces the deletion at c of 
(c, d). However, now we have deleted all the edges incident to d from consideration on 
a Hamilton circuit. This contradiction implies that G cannot have a Hamilton circuit. 

Note that this graph does have Hamilton paths — for example, a-b-f-e-g-i-j-k- 
h-c-d. m 


It should be emphasized that the following is not a useful line of reasoning to 
show that a graph has no Hamilton circuit: start from some vertex and construct a 
route visiting successive vertices and show that all attempts to find a circuit through 
all vertices fail. Even in a simple graph like the one in Figure 2.5, there are hundreds of 
possible beginning subpaths for a Hamilton circuit that would all have to be checked. 
The approach presented here requires much less work than a rigorous trial-and-error 
effort. 


Example 3: Nonexistence of Hamilton Circuit III 

Show that the graph in Figure 2.6 has no Hamilton circuit. 

Note that this graph has vertical and horizontal symmetry (although vertex n is 
off to one side, its adjacencies have a square-like symmetry). It sometimes takes a 



Figure 2.6 
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bit of trial-and-error experimenting to find a good vertex at which to start trying to 
build a Hamilton circuit when there are no vertices of degree 2. We seek a vertex with 
the property that once two edges are chosen at the vertex, then the use of Rules 1 
and 3 will force the successive deletion and inclusion of many edges. Vertex e is such 
a vertex. We can either use two edges incident at e from opposite sides (180° apart) 
or use two edges incident at e that form a 90° angle. We must examine both cases 
to show that no Hamilton circuit can exist. (Hamilton circuits frequently have many 
such subcases that must all be checked out.) 


Case I Suppose we use two edges incident at e from opposite sides. By symmetry, 
they can either be edges from d and /or from b and h. Suppose we choose (d, e ) and 
( e , /). Then by Rule 3, we can delete edges (e, b ) and (e, h). Then at b and at h we 
must use both remaining edges, getting subpaths a-b-c and g-h-i. Now at d we can 
use either edge (d, a) or edge id, g). The two cases are symmetrical with respect to 
the edges chosen for the circuit thus far. So, without loss of generality, we can choose 
edge ( d , a). 

At f we cannot use (/, c) or else subcircuit a-b-c-f-e-d-a results. So we must use 
(/, i). See Figure 2.6. Since we have used two edges at vertices a and i, the other edges 
at these vertices can be deleted by Rule 3. We now obtain several inconsistencies. 
Vertices j, k, l, and m each now have degree 2. But each is incident to n, and so using 
the two remaining edges at each causes four edges to be used at n. We conclude that 
there is no Hamilton circuit in Case I. 


Case II Suppose we use two edges incident at e that form a 90° angle. By symme- 
try it does not matter which of the four 90° angle pairs of edges we choose. Suppose 
we choose (b, e) and id, e). See Figure 2.7. Then by Rule 3 we can delete edges 
(e, /) and (e, h). Then at /and h we must use both remaining edges getting sub- 
paths c-f-i and g-h-i. See Figure 2.7. By Rule 3 at i, we can delete edges (/ k ) and 
(i, /). By Rule 1, we must use the remaining edges at k and /, forming the subcircuit 
i-f-c-k-n-l- g-h-i. 

Having obtained contradictions in both cases, we have proved that the graph has 
no Hamilton circuit. ■ 



Figure 2.7 
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We now present a few of the theoretical results about the existence of Hamilton 
circuits and paths. (See [4] for proofs of Theorems 1, 2, and 3.) 

Theorem 1 (Dirac, 1952) 

A graph with n vertices, n > 2, has a Hamilton circuit if the degree of each vertex is 
at least n/2. 

Theorem 2 (Chvatal, 1972) 

Let G be a connected graph with n vertices, and let the vertices be indexed 
X\,X 2 , , x n , so that deg(x,) < deg(x, + i). If for each k < n/2 either degfe) > k 
or deg(x„_,t) >n—k, then G has a Hamilton circuit. 

Theorem 3 (Grinberg, 1968) 

Suppose a planar graph G has a Hamilton circuit H. Let G be drawn with any planar 
depiction, and let r, denote the number of regions inside the Hamilton circuit bounded 
by i edges in this depiction. Let r' be the number of regions outside the circuit bounded 
by i edges. Then the numbers r, and r[ satisfy the equation 



(*) 


Just as the inequality e < 3v — 6 for planar graphs in the corollary in Section 1 .4 
could be used to prove that some graphs are not planar. Theorem 3 can be used to 
show that some planar graphs cannot have Hamilton circuits. 

Example 4: Application of Theorem 3 

Show that the planar graph in Figure 2.8 has no Hamilton circuit. 

We have indicated that the number of bounding edges inside each of the regions in 
the planar depiction of the graph in Figure 2.8. There are three regions with four edges 
and six regions with six edges. Thus, no matter where a Hamilton circuit is drawn 
(if it exists), we know that o + r' 4 — 3 and r (l + r' (> = 6. Observe that for this graph, 
(*) reduces to 


2 ( 0 -r;) + 4(r 6 — r^) = 0 



Figure 2.8 
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We cannot have r$ — T 6 = 0, that is, r () = r' () = 3, for then (*) would require 
r\ — h 4 = 0 or r\ — r ' 4 — which is impossible, since r$ + r 4 = 3. If r^ — r^ ^ 0, then 
l r 6 — r 'f} > 2, and so \4(r^ — r^)| > 8. Now it is impossible to satisfy (*), since even if 
r\ — 3, Tl 4 — 0 (or r 4 = 0,r l 4 — 3), |2(r4 — r' 4 )\ < 6. Thus, it is impossible for (*) to be 
valid for this graph, and so no Hamilton circuit can exist. ■ 


We next present a theorem involving directed graphs and Hamilton paths. 
A tournament is a directed graph obtained from a complete (undirected) graph 
by giving a direction to each edge. 

Theorem 4 

Every tournament has a Hamilton path. 

Proof 

The proof is by induction. For a 2- vertex tournament, a directed Hamilton path trivially 
exists. Next assume by induction that any tournament with n— 1 vertices, for n > 3, 
has a directed Hamilton path and let us prove that a n-vertex tournament G has a 
directed Hamilton path. 

Remove a vertex z from G, leaving a tournament G' with n — 1 vertices. By 
the induction assumption, G' has a Hamilton path H = X1-X1-X3— — x n _ 1 . If the 
edge between z and x\ is ("7xi), then z can be added to the front of H to ob- 
tain a Hamilton path for G. Similarly, if the edge between z and x„_i is (x„_i7 z), 
then z can be added to the end of H to obtain a Hamilton path for G. So assume 
that the edge from the first vertex x\ of H points toward z and that the edge from the 
last vertex x„_i of H points from z. Then for some consecutive pair on H, x,_i , x,-, 
the edge direction must change — that is, we have edges (x,_ f, z.) and (zf x, ). We can 
insert z between x,_i and x,- in H to obtain a Hamilton path X 1 -X 2 — — x,-_i-z— x,— 
x„- 1 . ♦ 

We conclude this section with an application of Hamilton paths to a problem in 
coding theory. 


Example 5: Gray Code 

When a spacecraft is sent to distant planets and transmits pictures back to earth, 
these pictures are transmitted as a long sequence of numbers, each number being a 
darkness value for one of the dots in the picture. For simplicity, assume the darkness 
numbers range between 1 and 8. These numbers are actually sent as a sequence 
of Os and Is. A straightforward encoding scheme would be to express each number 
in its binary representation, that is, 1 as 001, 2 as 010, 3 as 01 1, and so on, ending 
with 8 as 000. 

However, abetter scheme, called a Gray code, uses an encoding with the property 
that two consecutive numbers are encoded by binary sequences that are almost the 
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same, differing in just one position. For example, a fragment of a Gray code might 
be 4 as 010, 5 as Oil, 6 as 001. The advantage of such an encoding is that if an 
error from “cosmic static” causes one binary digit in a sequence to be misread at a 
receiving station on earth, then the mistaken sequence will often be interpreted as a 
darkness number that is almost the same as the true darkness number. For example, 
in the preceding fragment of a Gray code, if 01 1 (5) were transmitted and an error in 
the last position caused 010 (4) to be received, the resulting small change in darkness 
would not seriously affect the picture. (Of course, some errors will cause substantial 
inaccuracies.) 

With this background, we now translate the problem of finding a Gray code for 
the 8 darkness numbers into the problem of finding a Hamilton circuit in a graph. We 
define the graph as follows: Each vertex corresponds to a 3-digit binary sequence, 
and two vertices are adjacent if their binary sequences differ in just one place. The 
graph is shown in Figure 2.9 a. Observe that it is a cube (the binary sequences can be 
thought of as the coordinates of a cube drawn in three dimensions). A similar graph 
can be drawn for the 16 4-digit binary sequences, or for any given n, the 2" n-digit 
binary sequences. 

We claim that the order in which vertices (binary sequences) occur along a Hamil- 
ton path in this graph produces a Gray code. That is, 1 is encoded as the first binary 
sequence (vertex) in a given Hamilton path, 2 as the second binary sequence, and 
so on. This process yields a Gray code because consecutive vertices in the Hamil- 
ton path, which encode consecutive darkness numbers, will correspond to binary 
sequences that differ in just one position. Figure 2.9 b illustrates how a Hamilton path 
in the graph produces a Gray code. ■ 


The graph with one vertex for each n-digit binary sequence and an edge joining 
vertices that correspond to sequences that differ in just one position is called an 
n-dimensional cube, or hypercube. The graph in Figure 2.9 a is a 3-dimensional cube 
(or standard cube). An n-dimensional cube has 2" vertices, each of degree n. It has 
the property that the longest path between any two vertices has length n. 

A generalization of a Hamilton circuit is used to solve the Instant Insanity puzzle 
in the supplement to this chapter. 


Oil 111 



5 = 011 6=111 



(b) 


Figure 2.9 
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2.2 EXERCISES 


Summary of Exercises The first 10 exercises involve the exis- 
tence or nonexistence of Hamilton paths and circuits. Exercises 9 and 10 introduce 
other potential aids for proving nonexistence. Exercises 15—19 involve applications 
of Hamilton circuits. The last five exercises involve theory. 


1. (a) Draw a graph with a Hamilton circuit but no Euler cycle, 
(b) Draw a graph with an Euler cycle but no Hamilton circuit. 

2. Find a Hamilton circuit in each of the following graphs. 





3. Find a Hamilton circuit in the following graph. 


a 



4. In each of the following graphs, prove that no Hamilton circuit exists: 
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5. Find a Hamilton path in Figure 2.8 and show that no Hamilton circuit exists 
(using the reasoning in Examples 1, 2, 3). 

6. Show that there can be no Hamilton circuit in the following graph using both 
edges (a, /) and (c, /?). 


a 



7. Prove that the following graphs have no Hamilton circuits. 



d 
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8. Use Theorem 3 to show that the following planar graphs have no Hamilton circuit: 

(a) Exercise 4(a) (b) Exercise 4(b) (c) Exercise 4 (p) 

9. Recall from Example 1 in Section 1 . 1 that a graph is bipartite if the vertices can 
be divided into two sets; for convenience call them blue vertices and red vertices, 
such that every edge connects a blue and a red vertex. 

(a) Show that if a connected bipartite graph has a Hamilton circuit, then the 
numbers of red and blue vertices must be equal. Further, show that if a 
bipartite graph has an odd number of vertices, then it has no Hamilton circuit. 

(b) Show that if a connected bipartite graph has a Hamilton path, then the num- 
bers of reds and blues can differ by at most one. 

(c) Use part (a) to show that the following graphs have no Hamilton circuit: 

(i) Figure 2.8 (ii) Exercise 4 (in) (iii) Exercise 7(b) 

10 . Suppose a set I of k vertices in a graph G is an independent set — that is, no pair 
of vertices in I are adjacent. Then for each x in I, deg(x) — 2 of the edges incident 
to x will not be used in a Hamilton circuit. Summing over all vertices in /, we 
have e' = ^ rg/ (deg(x) — 2) = {^ ve/ (deg(x)} — 2k edges that cannot be used in 
a Hamilton circuit. 

(a) Let v and e be the numbers of vertices and edges in G, respectively. Show 
that if e — e' < v, then G can have no Hamilton circuit. 

(b) Why is the claim in part (a) valid only when /is a set of nonadjacent vertices? 

(c) With a suitably chosen set I, use part (a) to show that the following graphs 
have no Hamilton circuits: 

(i) Figure 2.6 (ii) Exercise 4 (p) (iii) Exercise 7(b) 

11. (a) Draw a 4-dimensional hypercube graph. 

(b) Use the graph in part (a) to find a Gray code for encoding the numbers 1 
through 16 as 4-bit binary sequences. 

12 . Let the distance between two vertices in a connected graph be defined as the 
number of edges in the shortest path connecting those two vertices. Then the 
diameter of a graph is defined to be greatest distance between any two vertices 
in the graph. Show that a 4-dimensional hypercube has diameter 4. In general, 
show that a /-dimensional hypercube has diameter k. Note: No graph with 2 k 
vertices, all of degree k, has a smaller diameter than k. This minimum is achieved 
by a /-dimensional hypercube. 

13 . Find a connected, cubic graph (all vertices have degree 3) with no Hamilton 
circuit. 

14 . Show without citing any theorems stated in this section that any 6-vertex, undi- 
rected graph with all vertices of degree 3 has a Hamilton circuit. 

15 . Find a path of knight’s moves visiting all squares exactly once on an 8 x 8 
chessboard. (Hint: Very tricky — get help on the Internet.) 
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16. Suppose a classroom has 25 students seated in desks in a square 5x5 array. The 
teacher wants to alter the seating by having every student move to an adjacent 
seat (just ahead, just behind, on the left, or on the right). Show that such a move 
is impossible. 

17. (a) Describe how to construct a circuit including all squares of an n x n chess- 

board, n even, using a rook. Then do the same using a king. 

(b) Repeat part (a) for n odd. 

18. Consider 27 little cubes arranged in a 3 x 3 x 3 array (as in Rubik’s Cube). 
Form an associated graph with 27 vertices, one for each little cube, and with two 
vertices adjacent if they have touching faces (not just edges). Does this graph 
have a Hamilton path starting at the vertex corresponding to the middle inside 
cube and ending at one of the vertices corresponding to a corner cube? 

19. (a) How many different Hamilton circuits are there in K n , a complete graph on 

n vertices? 

(b) Show that K n , n prime > 3, can have its edges partitioned into \(n — 1) 
disjoint Hamilton circuits. 

(c) If 17 professors dine together at a circular table during a conference, and 
if each night each professor sits next to a pair of different professors, how 
many days can the conference last? 

20. (a) If a graph G has an Euler cycle, show that L(G), the line graph of G (see Exer- 

cise 1 6 of Section 2. 1 for the definition of a line graph), has a Hamilton circuit. 

(b) If G has a Hamilton circuit, show that L( G ) has a Hamilton circuit. 

(c) Show that the converses of parts (a) and (b) are false by finding counter- 
examples. 

21. Show that if G is not a complete graph, then it is possible to direct the edges of 
G so that there is no directed Hamilton path. 

22. Show that in a tournament (defined preceding Theorem 4) it is always possible 
to rank the contestants so that the person ranked ;th beats the person ranked 
(; + l)st. (Hint: Use Theorem 4.) 

23. Show that Theorem 1 is false if the requirement of degree >\n is relaxed to 
just >\(n - 1). 

24. (a) Prove for n > 3 that an undirected graph with n vertices and at least ( " , 1 ) + 2 

edges must have a Hamilton circuit. 

(b) Show that part (a) is false if there are only ("“ 1 ) + 1 edges. 


2.3 GRAPH COLORING 

In Example 1 of Section 1 .4 we introduced the problem of map coloring — coloring 
the countries of a map so that two countries with a common border are assigned dif- 
ferent colors. The problem of showing that any map can be 4-colored tantalized 
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mathematicians for 100 years until a computer-assisted proof was obtained by 
Appel and Haken in 1976. More recently, graph coloring has been applied to a 
variety of problems in computer science, operations research, and experiment de- 
sign. Recall that coloring countries in a map is equivalent to coloring vertices, with 
adjacent vertices getting different colors, in the graph obtained by making a ver- 
tex for each country and an edge between vertices representing countries with a 
common border; see Example 1 in Section 1.4. In general, a coloring of a graph 
G assigns colors to the vertices of G so that adjacent vertices are given different 
colors. 

In this section we show how to determine the minimal number of colors required 
to color a given graph. This minimal number of colors is called the chromatic number 
of a graph. We also give some applications of graph coloring. In the next section we 
will present some theorems about graph coloring. In a coloring of a graph, the vertices 
that have a common color will be mutually nonadjacent (no pair is joined by an edge). 
In Example 5 of Section 1 . 1 we introduced the term independent set to refer to such 
a set of mutually nonadjacent vertices. We shall revisit that example later in this 
section. 

For graphs with 15 or fewer vertices, it is usually not difficult to guess a graph’s 
chromatic number. To verify rigorously that the chromatic number of a graph is a 
number k, we must also show that the graph cannot be properly colored with k — 1 
colors. Proving that a graph cannot be ( k — l)-colored is, like proving that a graph 
has no Hamilton circuit, an NP-complete problem (see Appendix A. 5). In this case, 
the goal is to show that any ( k — l)-coloring we might construct for the graph must 
force two adjacent vertices to have the same color. 


Example 1: Simple Graph Coloring 

Find the chromatic number of the graph in Figure 2.10. 

Looking at the inner square with crossing diagonals, we see that vertices a, c, e, 
g are mutually adjacent — that is, they form a complete subgraph. They each require a 
different color in a proper coloring: four colors in all. Once four colors are available, 
it is easy to properly color the remaining vertices b, d, f, h. Each of them is adjacent 
to only two other vertices, and so at most two out of the four colors need ever be 
avoided with these vertices. Let us use the numbers 1, 2, 3, 4 as the “names” of our 
colors. Then one possible 4-coloring of the graph is shown in Figure 2.10. 


b, 3 



Figure 2.10 
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Figure 2.11 

In this problem it is immediately clear that the graph cannot be 3-colored, since 
some adjacent pair of vertices in the complete subgraph formed by a, c, e, g would 
have the same color in a 3-coloring. So the chromatic number of this graph is 4. ■ 

Example 1 points out two important rules. First, a complete subgraph on k vertices 
requires k colors [cannot be ( k — l)-colored]. Second, when building a ^-coloring of 
some graph, we can ignore all vertices of degree <k (and their incident edges), since 
once the other vertices are colored, there will always be at least one color available 
(not used by any adjacent vertex) to properly color each such vertex. 

Example 2: Coloring a Wheel 

Find the chromatic number of the graph in Figure 2.11. 

We note that a graph of this form is called a wheel. The largest complete subgraph 
in this graph is a triangle. Let us try to build a 3-coloring of this graph with “colors” 
1, 2, and 3. We will start by coloring the vertices of a triangle. Suppose we choose 
the triangle a, b, f: Let a be 1 . h be 2, and/be 3 (the order of colors is arbitrary). 
See Figure 2.11. Since c is adjacent to vertices b and /of colors 2 and 3, respectively, 
c is forced to be color 1. Similarly, d is forced to be 2, and then e is forced to be 1. 
However, now the adjacent vertices a and e both have color 1 . Thus the graph cannot 
be 3-colored. On the other hand, using a fourth color for e yields a proper coloring. 
So the chromatic number of this graph is 4. ■ 

Observe that in Example 2 if vertex e were missing and d were adjacent to a 
instead, then three colors would work. In general, wheel graphs with an even number 
of “spokes” can be 3-colored, whereas wheels with an odd number of spokes require 
four colors. 

The key to the impossibility of finding a 3-coloring in Example 2 is the sequence 
c, d, e of forced vertex colors. In general, when attempting to build a ^-coloring 
graph, it is desirable to start by ^-coloring a complete subgraph of k vertices and then 
successively finding an uncolored vertex adjacent to previously colored vertices of 
k— 1 different colors, thereby forcing the color choice for this vertex. 

The following example involves a graph where vertex colors are not forced. 

Example 3: Unforced Coloring 

Find the chromatic number of the graph in Figure 2.12. 

The largest complete subgraph is again a triangle, and so we want to try building 
a 3-coloring. The only triangles whose coloring will force the color of another vertex 


a, 1 
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are (cl, e, f) and (e, /, g). Suppose we color d 1, e 2, and/3. Then g is forced to 
be 1. Now we are in trouble, since no more uncolored vertices are adjacent to two 
colors. We use the AC Principle in a multi-step argument based on the assumption 
that a 3-coloring exists. 

Observe that b and c are both adjacent to a vertex of color 1 and are adjacent 
to each other. Thus, one of b and c must be color 2 and the other color 3. By the 
symmetry of the graph, we can assume b is 2 and c 3, although it does not actually 
matter. What is important is that one of b, c is color 2 and the other color 3, for this 
forces the color of a to be 1. Similarly, h and i will be 2 and 3, collectively, forcing 
j to be 1. But the adjacent pair a, j are both 1. Thus the graph cannot be 3-colored. 
Making aorja fourth color yields a proper 4-coloring, and so the graph’s chromatic 
number is 4. ■ 


Let us now look at some applications of graph coloring. We start by repeating 
Example 5 of Section 1.1, which involved the problem of scheduling committee 
meeting times. 

Example 4: Committee Scheduling 

A state legislature has many committees that meet for one hour each week. One wants 
a schedule of committee meeting times that minimizes the total number of hours but 
such that two committees with overlapping membership do not meet at the same time. 
We show that this is a graph-coloring problem. 

The key information about the committees is which committees have overlapping 
membership. Let us create a graph with a vertex corresponding to each committee 
and with an edge joining two vertices if they represent committees with overlapping 
membership. For concreteness, suppose that the graph in Figure 2.13 represents the 
membership overlap of 10 legislative committees. We must schedule the vertices 
so that adjacent vertices (overlapping committees) get different meeting hours. A 



Figure 2.13 
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coloring of this graph performs exactly this type of “scheduling.” The colors will 
represent different meeting times. The graph in Figure 2.13 has a chromatic number 
of 4 (a minimal coloring is shown in Figure 2.13). Thus, four hours suffice to schedule 
committee meetings without conflict. ■ 

Example 5: Garbage Truck Scheduling 

We now consider a complex optimization problem in which graph coloring plays only 
a secondary role. There is a set of sites .S', that must be serviced (visited) k, times each 
week (1 < kj < 6). We seek a minimal set of day-long truck tours for a week such 
that site S, is visited on kj of the tours. In addition, we require that these tours can 
be partitioned among the six days of the week (Sunday is excluded) in a manner so 
that no site is visited twice on one day. This is an extremely difficult problem, which 
cannot be solved exactly. 

When a problem of this type (involving garbage collection) was analyzed for 
the New York City Department of Environmental Protection by the author and some 
colleagues, an algorithm was used that started with an inefficient set of tours and 
successively tried to improve the set of tours. Suppose we had a simplified situation 
where the week contained just three workdays, and our algorithm had generated the 
set of tours shown in Figure 2.14 a. Can these six tours be so partitioned among the 
three workdays so that no site is visited twice on the same day? Another way to state 
the constraint is: if two tours have a site in common, then the tours are assigned to 
different workdays. This is the type of constraint handled by a coloring model. 

Given a set of tours, we form an associated tour graph with one vertex for each 
tour and two vertices adjacent if they correspond to two tours that visit a common 
site. We assign colors, representing the different workdays, to the vertices (tours) of 
the tour graph. In the general six-day problem, partitioning the tours among the six 
days is equivalent to 6-coloring the tour graph. In the tour graph in Figure 2.14 b, we 
desire just a 3-coloring, which indeed exists. If the optimizing algorithm were next to 
combine tours A and F in Figure 2. 14a to get a smaller set of tours, such a move would 
have to be blocked (and other optimizing moves tried instead) because the resulting 
tour graph would have a complete graph on the four vertices A, B, C, D\ this would 
require four colors (days). ■ 




Figure 2.14 
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We close this section with an example about edge coloring. An edge coloring 
assigns a color to each edge so that two edges with a common end vertex have different 
colors — that is, the edges incident at a vertex all have different colors. 

Example 6: Scheduling Round-Robin Tournaments 

In a round-robin tournament, each of the n contestants plays every other contestant 
once. A schedule for a round-robin tournament specifies which games are played each 
day. We assume that no contestant plays more than one game a day. 

(a) Model the problem of scheduling games in a round-robin tournament as an edge 
coloring problem. 

(b) Find a minimal (fewest days) schedule for a tournament with four contestants. 

(c) Find a minimal (fewest days) schedule for a tournament with five contestants. 

(a) We make a graph with a vertex for each contestant and an edge to represent 
a game between pairs of contestants. The graph will be a complete graph K n on n 
vertices (with an edge between every pair of vertices). Let the days in a tournament be 
the colors. Then assigning a day to each game in the tournament so that no contestant 
plays two games on the same day is equivalent to assigning colors to the edges of K n 
so that no two edges incident to a vertex have the same color. 

(b) We need to find a minimal edge coloring of K^. Name the vertices (contestants) 
a, b,c, d. Consider the edges (games) for a. Let a play b on day 1, a play c on day 2, 
and a play d on day 3. Then on each day, let the two contestants not playing a play 
each other. In terms of edges, (a, b) and (c, d) have color 1, (a, c) and (b. d) have 
color 2, and (a, d) and ( b , c) have color 3. 

(c) We need to find a minimal edge coloring of Ks . Name the vertices (contestants) 
a. b, c, d, e. Note that at most two games can take place on one day. In terms of the 
edge coloring, at most two edges can have a common color. Since Ks has 10 edges 
(see Figure 2.15), we will need at least five colors for the edge coloring. It is a bit 
tricky to find an edge 5-coloring of K$. We use the following geometric strategy. Let 
each edge on the outer pentagon of Ks, as drawn in Figure 2.15, be given a different 
color. In Figure 2.15, we give (a, b ) color 1, ( b , c) color 2, (c, d) color 3, id, e) color 
4, and (e. a) color 5. Next we give each of the 5-chord edges in the inner star the same 
color as the outside edge that is parallel to it. For example, chord (a, c) is parallel to 
outer edge (d, e ), and so (a, c) is given color 4. Since parallel edges obviously cannot 
have a common end vertex, this is a valid edge coloring. ■ 



Figure 2.15 
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2.3 EXERCISES 


Summary of Exercises Exercises 1-6 involve finding minimal 
vertex colorings and associated problems. Exercises 9-1 1 require minimal colorings 
of maps and a geometric array. Exercises 12-17 are color-modeling problems. 


1. Find the chromatic number of each of the following graphs. Give a careful argu- 
ment to show that fewer colors will not suffice. 
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(n) 


(o) 


b c 



b g 



(r) 


a 





2. Find a minimal edge coloring of the following graphs (color edges so that edges 
with a common end vertex receive different colors). 



3. (a) Draw a graph with seven vertices that is 3-chromatic, planar, and without an 

Euler cycle. 

(b) Repeat part (a), but now make the graph non-planar. 

4. (a) Draw a graph with over eight vertices that is 3-chromatic, that is planar, that 

has a Hamilton circuit, and that has an Euler cycle. 

(b) Repeat part (b), but now make the graph 4-chromatic. 

5. A graph G is color critical if the removal of any vertex of G decreases the 
chromatic number. Which graphs in Exercise 1 are color critical? 

6. Find all sets of three or more vertices that could have the same color in a proper 
coloring of the graph in 

(a) Exercise 1(a) (b) Exercise 1(b) 

7. A coloring partitions a graph G into sets of mutually nonadjacent vertices. In 
the complement G of G, this partition becomes a partition of G into sets of 
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mutually adjacent vertices — that is, complete subgraphs. Find such a minimal 
set of complete subgraphs partitioning the vertices in the graph in 

(a) Exercise 1(a) (b) Exercise 1(b) (c) Exercise l(m) 

8. (a) Extend the scheduling solution for the 5 -person round-robin tournament 

found in Example 6 to a 6-person tournament. {Hint: The key is the per- 
son who does not play each day in the 5-person tournament.) 

(b) Copy the geometric approach for solving the 5-person tournament in Exam- 
ple 6 to find minimum schedules for a 7-person and a 9-person tournament. 

(c) Building on your results in parts (a) and (b), propose a general formula for 
the minimum number of days it takes it takes to play an n -person round-robin 
tournament. Note that the answer will depend on whether n is odd or even. 

9. Can the 50 states in a map of the United States be properly 3-colored? (Note that 
states meeting only at a corner, such as Colorado and Arizona, are not considered 
adjacent.) 

10. Suppose a map is made by drawing n intersecting circles. Show that the regions 
in this map can be properly 2-colored. 

(a) Solve by an inductive argument. 

(b) Solve by assigning colors based on the number of circles that contain a region. 

11. How many colors are needed to color the 15 billiard balls in this triangular array 
so that touching balls are different colors? 



12. A zoo is going to place its animals in a set of large open areas, instead of having 
them in individual cages. If two different animals cannot live together peacefully 
(e.g., a tiger and deer cannot live together because the tiger will eat the deer), then 
they must be put in different open areas. The zoo wants to determine the minimum 
number of open areas needed to safely house all its animals. Model this problem 
of assigning animals to a minimal number of open areas as a graph-coloring 
problem. What are the vertices, what are the edges, what are the colors? 

13. The Applied Math Department is scheduling the times for classes for next 
semester. Each student has already decided which subset of ApMath classes 
he or she wants to take. Classes must be scheduled so that every student can 
have the courses requested without conflict. Model this scheduling problem as a 
graph-coloring problem. What are the vertices, what are the edges, what are the 
colors? 

14. (a) A set of solar experiments is to be made at observatories. Each experiment 

begins on a given day of the year and ends on a given day (each experiment is 


2.4 Coloring Theorems 11 


repeated for several years). An observatory can perform only one experiment 
at a time. The problem is: what is the minimum number of observatories re- 
quired to perform a given set of experiments annually? Model this scheduling 
problem as a graph-coloring problem. 

(b) Suppose experiment A runs from Sept. 2 to Jan. 3, experiment B from Oct. 
15 to March 10, experiment C from Nov. 20 to Feb. 17, experiment D from 
Jan. 23 to May 30, experiment E from April 4 to July 28, experiment F from 
April 30 to July 28, and experiment G from June 24 to Sep. 30. Draw the 
associated graph and find a minimal coloring (show that fewer colors will 
not suffice). 

15 . There are 12 cruise ships scheduled to be in a port for various days during a 
given week. The ships must be assigned to one of five different piers (only one 
ship at a time can be docked at a pier). The question is whether an assignment 
of ships to piers is possible, given ships’ visiting plans. Describe how to make 
a graph-coloring model of this assignment problem: What are the vertices, what 
are the edges, what are the colors? 

16 . A banquet center has eight different special rooms. Each banquet requires some 
subset of these eight rooms. Suppose that there are 12 evening banquets that we 
wish to schedule in a given week (seven days). Two banquets that are scheduled 
on the same evening must use different special rooms. Model and restate this 
scheduling problem as a graph-coloring problem. 

17 . Which of the following pairs of tours in Figure 2.14a can be combined without 
violating the 3-colorability requirement of the tour graph in Example 6? 

(a) Tours D and E (b) Tours C and D 

18 . Consider a graph representing games played between a set of football teams with 
a directed edge from vertex A to vertex B if team A beats team B. Suppose that 
it is known that this graph has no directed circuits (i.e., no situation such as A 
beats B, B beats C, and C beats A). We can define a set of levels in this football 
graph as follows: A vertex with no outward edges (the team beat no other team) 
is at level 0; if all a vertex’s outward edges go to level-0 vertices (no-win teams), 
then the vertex is at level 1 ; and in general, a vertex is at level k if the greatest 
level of a team it beat is level k — 1 . Show that the level number of each vertex 
is a proper “coloring” of the vertices. 



2.4 COLORING THEOREMS 

In the previous section we studied strategies for finding a minimal coloring of a graph 
and gave some applications that could be modeled as graph-coloring problems. In 
this section we present some theorems about graph coloring. 
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We begin with a theorem about coloring the corners of a polygon and use it to 
obtain a simple solution to an interesting problem in computational geometry. We 
treat a polygon as a plane graph consisting of a single circuit with edges drawn as 
straight lines. The polygon need not be a convex figure. See the sample polygon in 
Figure 2.16 a. By a triangulation of a polygon, we mean the process of adding a set of 
straight-line chords between pairs of vertices of a polygon so that all interior regions 
of the graph are bounded by a triangle (these chords cannot cross each other nor can 
they cross the sides of the polygon). Figure 2.16/? shows one possible triangulation 
of the polygon in Figure 2.16 a. 

Theorem 1 

The vertices in a triangulation of a polygon can be 3-colored. 

Proof 

Our proof is by induction on n, the number of edges of the polygon. For n = 3, give 
each corner a different color. Assume that any triangulated polygon with less than n 
boundary edges, n > 4, can be 3-colored and consider a triangulated polygon T with 
n boundary edges. 

Pick a chord edge e, as illustrated by chord (g, k ) in Figure 2. 17a. We note that T 
must have at least one chord edge, since n > 4. This chord e splits T into two smaller 
triangulated polygons, as shown in Figure 2.11b, each of which can be 3-colored by 
the induction assumption. The 3-colorings of the two subgraphs can be combined to 
yield a 3-coloring of the original triangulated polygon by picking the names for the 


j 
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Figure 2.17 
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colors in the two subgraphs so that the end vertices of chord e have the same colors in 
each subgraph. In Figure 2.17 b, this would mean making the color of k be the same 
in the two subgraphs and making the color of g be the same. ♦ 

In practice, it is easy to produce a 3-coloring of a triangulation of a polygon. 
Moreover, this 3-coloring is unique. For details, see Exercise 7. We now present an 
interesting application of Theorem 1 to a problem that seems to have nothing to do 
with coloring. 

The Art Gallery Problem asks the smallest number of guards needed to watch 
paintings along the n walls of an art gallery. The walls are assumed to form a polygon. 
The guards need to have a direct line of sight to every point on the walls. A guard 
at a corner is assumed to be able to see the two walls that end at that corner. The 
expression [/J denotes the largest integer < r. For more information about the Art 
Gallery Problem, see O’Rourke [5]. 

Corollary (Fisk, 1978) 

The Art Gallery Problem with n walls requires at most [n /3J guards. 

Proof 

Make a triangulation of the polygon formed by the walls of the art gallery. Observe 
that a guard at any corner of a triangle has all sides of the triangle under surveillance. 
Now obtain a 3-coloring of this triangulation. Note that each triangle will have one 
corner of each color. Take one of the colors, say “red,” and place a guard at every 
corner colored “red.” This places a guard at a corner on every triangle. Hence the sides 
of all triangles, and, in particular all the gallery walls, will be watched. A polygon 
with n walls has n corners. If there are n corners and three colors, some color is used 
at [ n /3J or fewer corners. ♦ 

This bound is the best possible. For example, Figure 2.18 gives an example of a 
polygon with 12 corners that requires four guards. 

We now present three representative coloring theorems. We will not prove the 
first two. For proofs of Theorems 2 and 3 see [4]. Note that Theorem 2 in Section 
1 .3 was a coloring theorem. As stated there, it said that a connected graph is bipartite 
if and only if all circuits have even length. Observe that being bipartite is the same 
as being 2-colorable. Thus, a graph is 2-colorable if and only if all circuits have 
even length. (We can drop the condition that the graph be connected, because if each 
component is 2-colorable, then the whole graph is 2-colorable.) Let y(G) denote the 
chromatic number of the graph G. 



Figure 2.18 
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Theorem 2 (Brooks, 1941) 

If the graph G is not an odd circuit or a complete graph, then / (G) < d, where d is 
the maximum degree of a vertex of G. 

The maximum degree is for most graphs a poor upper bound on /(G). The 
examples in Section 2.3 had /(G) closely related to the size of the largest complete 
subgraph. Thus, it seems natural that there should be a good bound on /(G) in terms 
of the size of the largest complete subgraph. However, for any positive integer k, there 
exists a triangle-free graph G with /(G) = k: see Exercise 18 for details. 

Instead of coloring vertices, we can color edges so that edges with a common 
end vertex get different colors. A very good bound on the edge chromatic number of 
a graph in terms of degree is possible. All edges incident at a given vertex must have 
different colors, and so the maximum degree of a vertex in a graph is a lower bound 
on the edge chromatic number. Even better, one can prove the following: 

Theorem 3 (Yizing, 1964) 

If the maximum degree of a vertex in a graph G is d, then the edge chromatic number 
of G is either d or d + 1 ■ 

Finally we would like to present and prove a theorem about coloring planar 
graphs. As noted earlier, in 1976 Appel and Haken [1] proved that all planar graphs 
can be 4-colored. But their proof is incredibly long and requires computer analysis 
involving 1,955 cases, each of which in turn involves many pages of analysis. We will 
state and prove an easier “second best” theorem. 

Theorem 4 

Every planar graph can be 5-colored. 

Proof 

We need to consider only connected planar graphs, since we can 5-color unconnected 
planar graphs by 5-coloring each connected component. A key step in this proof uses 
a fact about planar graphs proved in Exercise 1 8 in Section 1 .4: Any connected planar 
graph has a vertex of degree at most 5 . We prove this theorem by induction on the 
number of vertices. Trivially a 1 -vertex graph can be 5-colored. 

Next we assume that all connected planar graphs with n — 1 vertices (n > 2) 
can be 5-colored. We will prove that a connected planar graph G with n vertices 
can be 5-colored. As noted above, G has a vertex x of degree at most five. Delete x 
from G to obtain a graph with n — 1 vertices, which by assumption can be 5-colored. 
Now we reconnect x to the rest of the graph and try to properly color x. If x has 
degree <4, then we can simply assign x a color different from the colors of its 
neighbors. The same approach works if the degree of x is five but two neighbors have 
the same color. Thus, it remains to consider the case where x has five adjacent vertices 
each with a different color. See Figure 2.19, where we label the neighbors of x as 
a,b,c,d,e according to their clockwise order about x in some planar depiction of G. 
Let the colors be the numbers shown in Figure 2.19. 
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Figure 2.19 

First consider all paths starting from a whose vertices are colored 1 and 3. See 
Figure 2.19. Suppose there is no path consisting of 1 and 3 vertices from a to c. Then 
we can change the color of a from 1 to 3, change the neighbors of a colored 3 to 1, and 
so on along all paths of 1 and 3 vertices emanating from a. This 1-3 interchange will 
not affect c since there is no path of Is and 3s from a to c. After this 1-3 interchange 
from a, a and c are both color 3, and x can be properly colored with 1 . 

On the other hand, if there is a 1-3 path from a to c, then we consider all paths 
starting from b, whose vertices are colored 2 and 4. The 1-3 path from a to c, together 
with edges ( x , a), (x, c), form a circuit that blocks the possibility of any 2-4 path 
going from b to d. Thus, we can perform a 2-4 interchange along all paths of 2s and 
4s emanating from b without changing the color of d. After this 2-4 interchange, b 
and d are both color 4, and x can be properly colored with 2. 

This completes the induction step of the proof that any /7-vertex connected planar 
graph can be 5-colored. ♦ 

We close this section with a brief introduction to chromatic polynomials. The 
chromatic polynomial Pk(G) of a graph G gives a formula for the number of ways 
to properly color G with k colors. The formula is a polynomial in k. If k is so small 
that G cannot be colored with only k colors, then Pk(G) will have to equal 0 for that 
value of k. 

Example 1: Chromatic Polynomial 

What is the chromatic polynomial of 

(a) A path P 6 = a-b-c-d-e - /? 

(b) A complete graph K$ on five vertices (all vertices adjacent to each other)? 

(c) The graph C 4 of a circuit of length 4? 

(a) There are k color choices for a. There are k — 1 color choices for b (any 
color but the one used for a). Similarly, there are k — 1 color choices for successively 
coloring c, then d, then e, and /. So Pk(Pe) = k(k — l) 5 . 

(b) In a complete graph, each vertex must be a different color. Thus, Pk(Ks) = 
k(k — 1 )(k — 2 ){k — 3 ){k — 4), since there are k possible choices for the first vertex 
to be colored; then that color cannot be used again, and so the second vertex to be 
colored has k — 1 choices, and so on. 
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(c) Let the circuit C 4 be a-b-c-d-a. We break the computation of Pk(C 4 ) into 
two cases, depending on whether or not a and c are given the same color. 

If a and c have the same color, there are k choices for the color of these two 
vertices. Then b and d each must only avoid the common color of a and c — k — 1 
color choices each. So the number of A:-colorings of C 4 in this case is k(k — l) 2 . 

If a and c have different colors, there are k(k — 1) choices for the two different 
colors for a and then c. Now b and d each have k — 2 color choices. So in this case 
the total number of fc-colorings of C 4 is k(k — 1 )(k — 2) 1 . Combining the two cases, 
we obtain P/fCf) — k{k — l ) 2 + k{k — 1)(£ — 2) 2 . ♦ 

The results in parts (a) and (b) of Example 1 can be generalized. Part (a) can be 
extended to any connected graph with no circuits. Such graphs are called trees. Trees 
are the subject of the next chapter. 

Theorem 5 

(a) If T is a tree with n vertices, then Pi ( T ) — k(k — l)" -1 . 

(b) If K„ is a complete graph in n vertices, then Pi- ( K n ) = k{k — \ )(k — 2) . . . (k — 
n + 1). 

Proof 

(a) Choose some vertex in T ; call it a. There are k color choices for a. Let Si be the 
vertices adjacent to a. Note that there is no edge between two vertices x, y in Si, 
since then a, x, y would form a triangle, that is, a circuit of length 3. Then each 
vertex in Si can be any of the color except the color of a : k — 1 choices for each 
vertex. Now let S 2 be vertices at the end of paths of length 2 (two edges) from a. To 
avoid forming a circuit, no two vertices in Si can be adjacent and each is adjacent 
to only one vertex in Si. So each vertex in S 2 again has k — 1 color choices (any 
color except that of the unique adjacent Si -vertex). This same argument applies 
for S 3 , S 4 , etc. Thus, except for a, each vertex has k — I color choices. 

(b) The n vertices in K„ must each be a different color. The result follows from the 
same reasoning as used in Example 1(b). ♦ 

Note that if a graph is not connected, its chromatic polynomial is the product of 
the chromatic polynomials of each connected component. So we only need to worry 
about determining chromatic polynomials for connected graphs. 

Theorem 5 is a starting point for finding the chromatic polynomial for an arbitrary 
connected graph. The following decomposition theorem shows how to repeatedly 
reduce the chromatic polynomial of a given graph into the sum of the chromatic 
polynomials of two simpler graphs with fewer edges missing. The reduction eventually 
leads to complete graphs. The following corollary gives a decomposition that yields 
simpler graphs with fewer edges, eventually leading to trees. 

Theorem 6 

Let x, y be two non-adjacent vertices in the graph G. Let G + ( Xty ) be the graph 
obtained from G by adding the edge {x, y). Let G x=y be the graph obtained from G 
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by coalescing x and y into a single vertex xy that is adjacent to any vertex that was 
adjacent to x or y in G. Then 

P k (G) = P k (G +ix , y) ) + P k (G x=y ) (1) 


Proof 

The proof of (1) is based on the simple fact that all /-colorings of the vertices of G 
can be broken into two disjoint cases: (a) x and y are assigned different colors and (b) 
x and y are assigned the same color. The number of /-colorings in case (a) equals the 
number of /-colorings of G + ( Xty) , since adding the edge (x, y) forces x and y to have 
different colors. Similarly, the number of /.'-colorings in case (b) equals the number 
of /'-colorings of G x=y . ♦ 

Corollary 

Let x, y be two adjacent vertices in the graph G. Let G_ (xy) be the graph obtained 
from G by deleting the edge (x, y). Let G x=y be the graph obtained from G by 
coalescing x and y, as in Theorem 6. 

P k (G) = P k (G- (x , y) ) - P k (G x=y ) (2) 


Proof 

Let H be the graph G-( x , y ). Vertices x, y are now nonadjacent in H. Observe, con- 
versely, that H + ( x y> = G. Also, H x=y = G x=y . Applying identity (1) to H and non- 
adjacent vertices x, y yields 

Pk(H) = Pk(H Mx , y) ) + P k (H x=y ) (3) 

Recasting (3) in terms of G, and G x=y , we have 

P k (G_ (*,,)) = P k (G) + P k (G x=y ) (4) 

Solving for P k (G ) in (4), yields (2). ♦ 

We now rework part (c) of Example 1 using Theorem 6. 

Example 1(c) (repeated): Applying Theorem 6 

What is the chromatic polynomial of the 4-circuit C4 = a-b-c-d-al 

We determine the chromatic polynomial of C4 first using identity (1) and then a 
second time using identity (2). We apply (1) to C4 with nonadjacent vertices a, c. The 
graph (C4) + ( fljC ) consists of two triangles, (a, b, c ) and (a, c, d ) with the common edge 
(a, c). The triangle (a, b, c ), a complete graph, can be /-colored in k{k — I )(k — 2) 
ways by Theorem 5(b). Next there are k — 2 choices for d (avoiding the colors of a and 
c). So P k ((C 4 )+ (a ,c)) = k(k — 1 )(/ — 2) 2 . The graph (C4) fl=c is the 3-vertex path b-ac-d 
(where ac is the vertex formed by coalescing a and c). So P k {{Cf) a=c ) — k(k — l) 2 , 
by Theorem 5(a). Summing these two chromatic polynomials, we have PkiCf) = k 
{k _ !)[(/_ i ) + {k - 2 ) 2 ]. 
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Next we apply identity (2) to C 4 with adjacent vertices a, b. The graph (C 4 )_( ai /,) is 
the 4- vertex path b-c-d-a with chromatic polynomial k(k — l) 3 4 5 6 7 8 , by Theorem 5(a). The 
graph (C 4 ) a= b is the triangle ( ab , c, d) with chromatic polynomial kik — I )(k — 2). 
Then, by (2), 


Pjfc(C 4 ) = Pk((C4)-{a,b)) — Pk((C4) a= b ) 

= kik - l) 3 - kik - 1 )ik - 2) = kik - l)[(k - l) * 1 2 - ik - 2)] ♦ 


Note that the answer using identity (1) has exactly the same form as the answer 
for part (c) when we originally worked Example 1 using ad hoc methods. The answer 
we got using identity (2) looks a little different but is algebraically equivalent. 


2.4 EXERCISES 



' Summary of Exercises Most of the exercises are proofs of results 
in coloring theory. 

1. Use the fact that every planar graph has a vertex of degree <5 to give a simple 
induction proof that every planar graph can be 6-colored. Follow the argument 
in the beginning of the proof of Theorem 5. 

2. Show that a planar graph G with eight vertices and 13 edges cannot be 2-colored. 
iHint: Use results in Section 1.4 to show that G must contain a triangle.) 

3. Suppose a graph G has k connected components, labeled G\, G 2 , . . . , Gk- Let 
X(Gj) denote the chromatic number of the ith component. Give a formula for 
the chromatic number / (G) of G in terms of the x(G,)’s. 

4. Let G be a graph with every vertex of degree at least 3. Prove or give a counterex- 
ample to the assertion that at least three colors are needed to color the vertices 


of G. 


5. For which graphs are the sizes of the minimal vertex and minimal edge colorings 
both equal to 2? 

6 . Prove by induction that the graph of any triangulation of a polygon will have 
at least two vertices of degree 2. (Hint: Split the triangulation graph into two 
triangulation graphs at some chord e.) 

7. (a) Using the type of reasoning in Section 2.3, explain how to 3-color any trian- 

gulation of a polygon. 

(b) Use the argument in part (a) to show that the 3 -coloring of any triangulation 
of a polygon is unique, except for changing the names of the colors. 

8. (a) If q is the size of the largest independent set in a graph G, show that 

X( G)q > n, where n is the number of vertices in G. 

(b) If the minimum degree of a vertex is d in an n-vertex graph G, then use the 
result in part (a) to show that /(G)(n — d)>n, and hence / (G) > n/in — d). 
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9. A graph is color critical if the removal of any vertex decreases the graph’s 
chromatic number. Show that every ^-chromatic color critical graph G has the 
following properties: 

(a) G is connected. 

(b) Every vertex of G has degree >k — 1 . 

(c) G has no vertex whose removal disconnects G. 

10. Show that G can be edge /.'-colored if and only if L(G), the line graph of G (see 
Exercise 16 in Section 2.1), can be vertex fc-colored. 

11. If G is the complement of G, then show that 

(a) y (G) + X ( G ) <n + 1 (Hint: Use induction.) 

(b) x(G)x(G) > n 

(c) x(G) + x(G)> 2^/n 

12. Show that if every region in a planar graph has an even number of bounding 
edges, then the vertices can be 2-colored. 

13. Show that no planar graph G has a chromatic polynomial of the form Pk(G) = 
( k 2 — 6 k + 8 )Q(k), where Q(k) is positive for k > 0. 

14. Determine the chromatic polynomial Pk(G ) for the following graphs: 



(d) (e) 


15. Show that a graph is /c-colorable if and only if its edges can be directed so that 
there is no directed circuit and its longest path has length k — 1. 

16. Use the fact that every planar graph with fewer than 12 vertices has a vertex of 
degree <4 (Exercise 19 in Section 1 .4) to prove that every planar graph with less 
than 12 vertices can be 4-colored. 

17. Show that a graph with at most two odd-length circuits can be 3-colored. 

18. Prove that for any positive integer k, there exists a triangle-free graph G with 
X(G) — k. 

(a) The proof should be by induction on k, the chromatic number. Initially for 
k — 3, we use the graph G 3 consisting of a 5-circuit. 

(b) Assuming one can construct Gk, a triangle-free graph with x. (G k) — k, one 
constructs Gk + 1 by making k copies of Gk and then adding (n^f vertices, 
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where % is the number of vertices in &V Each new vertex has as its set of 
neighbors a different k-tuple consisting of one vertex from each copy of G/ ( . 
Confirm that this new graph is the desired Gk + 1 ■ 




2.5 SUMMARY AND REFERENCES 


This chapter presented two important graph-theoretic concepts: covering cycles or 
circuits and coloring. Section 2. 1 discussed Euler cycles — cycles that traverse every 
edge exactly once. Section 2.2 discussed Hamilton circuits — circuits that visit every 
vertex exactly once. Both types of covering edge sets arise naturally in operations 
research routing problems. Despite the similarity in the definitions of Euler cycles and 
Hamilton circuits, determining the existence of Euler cycles and Hamilton circuits in a 
graph are as different as graph-theoretic problems can be. Euler’s theorem allows one 
to quickly decide whether an Euler cycle exists. On the other hand, except in special 
cases, the existence or nonexistence of a Hamilton circuit can be determined only by a 
laborious systematic search to try all possible ways of constructing a Hamilton circuit. 

Section 2.3 introduced graph coloring with ad hoc coloring schemes and some 
applications of coloring. Section 2.4 gave a sampling of coloring theory, highlighted 
by a proof of the fact that any planar graph can be 5-colored. The stronger theorem 
proved in 1976 by Appel and Haken [1], that planar graphs are 4-colorable, was the 
motivation of much of the research in graph theory over the previous 100 years. The 
search for a proof of the Four Color Theorem led to reformulations of this theorem 
in terms of Hamilton circuits and other graph concepts whose properties were then 
examined. 

Euler’s 1736 analysis of Euler cycles was the first paper on graph theory. Euler’s 
paper (translated in Biggs, Lloyd, and Wilson [3]) makes very interesting reading. It 
is instructive to see how awkward Euler’s writing was when he lacked the modern 
terminology of graph theory. The first use of the concept of a Hamilton circuit occurred 
in a 1771 paper by A. Vandermonde that presented a sequence of moves by which 
a knight could tour all positions of a chessboard (without repeating a position). The 
name “Hamilton” refers to W. Hamilton, whose algebraic research led him to consider 
special types of circuits and paths on the edges of a dodecahedron [the graph in 
Exercise 2(c) in Section 2.2], Hamilton even had a game marketed that involved 
finding a Hamilton circuit on a dodecahedron (Hamilton’s instructions for this game 
are reprinted in [3]). See Barnette [2] for a good history of the Four Color Problem, 
its restatements, and final solution by Appel and Haken. 

1. K. Appel and W. Haken, “Every planar map is 4-colorable,” Bull. Ain. Math. Soc. 
82(1976), 711-712. 

2. D. Barnette, Map Coloring and the Four Color Problem, Mathematical Associ- 
ation of America, Washington, DC, 1984. 
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3. N. Biggs, E. Lloyd, and R. Wilson, Graph Theory 1736-1936, Cambridge Uni- 
versity, Cambridge, 1999. 

4. J. Bondy and U. Murty, Graph Theory with Applications, American Elsevier, 
New York, 1976. 

5. J. O’Rourke, Art Gallery Theorems and Algorithms, Oxford University Press, 
New York, 1987. 


SUPPLEMENT: GRAPH MODEL FOR INSTANT INSANITY 


This supplement presents a clever solution to the Instant Insanity puzzle devised by Blanche 
Decartes (an alleged pseudonym for the famous graph theorist W. Tutte). The solution uses a 
generalized form of Hamilton circuit in which all vertices are covered by a collection of 
vertex-disjoint circuits rather than a single circuit. 

The Instant Insanity puzzle consists of four cubes whose faces are colored with 
one of the four colors: red (R), white (W), blue (B), and green (G). The six faces 
on the f'th cube are denoted: f — front face, /, — left face, b, — back face, r , — right 
face, ti — top face, and m, — under face. In Figure 2.20, the first cube is colored: 
/ ! = B,r\ — W, fi = R,b\ = R, t\ = G, ii\ = B. The objective of this puzzle is to 
place the four cubes in a pile (cube 1 on top of cube 2 on top of . . . etc.) so that 
each side of the pile has one face of each color. For example, Figure 2.21 shows one 
solution for the pile of cubes given in Figure 2.20. 

We shall work with the four cubes shown in Figure 2.20. Determining how 
to arrange these four cubes in a pile that is an Instant Insanity solution is a very 
difficult task. Observe that there are 24 symmetries of a cube, and thus 24 4 = 331,776 
different piles that can be built. An enumeration tree search will be immense, although 
symmetries of the face colors and the constraint of no repeated colors on a side will 
eliminate many possibilities. A computer program to do this search is easily written. 
Fortunately, we can model this puzzle with a 4-vertex graph in such a fashion that 
the graph-theoretic restatement of the puzzle can be solved by inspection in a few 
minutes. 

Before presenting the graph model, we need to discuss a simple decomposition 
principle for this puzzle. Arranging the cubes in a pile so that left and right sides 
of the pile have one face of each color is “independent” of arranging the front and 
back sides with one face of each color. By “independent” we mean that once cube 



Figure 2.20 


Cube 1 


Cube 2 


Cube 3 


Cube 4 
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i is arranged so that a given pair of opposite faces are on the left and right side of the 
pile, then any remaining pair of opposite faces on cube i can be on the front and back 
sides of the pile. 

Let If, rf, f * , hf denote the colors of the four respective faces of cube i visible 
on the four sides of the pile when cube i is reoriented to obtain an Instant Insanity 
solution. Suppose that If = t\=G and rf = u\ = B, that is, the top face t\ of cube 1 
in Figure 2.20, which is green, is reoriented to be the left side If, and the under face 
mi, which is blue, is reoriented to be the right side rf. Then by rotating cube 1 about 
the centers of these two faces, we can get ff = B, bf = W, or /* = R, hf = K, or 
/* = W,bf = B — all possible remaining choices for front and back sides. Since this 
same left-right and front-back “independence” holds for the other cubes, we see that 
the puzzle can be broken into two disjoint problems. 


Decomposition Principle 

1. Pick one pair of opposite faces on each cube for the left and right sides of the 
pile so that these two sides of the pile will have one face of each color. 

2. Pick a different pair of opposite faces on each cube for the front and back sides 
of the pile so that these two sides will have one face of each. 

Now we are ready to present our graph model (due to F. de Carteblanche). 
Actually we use a multigraph (in which multiple edges and loops are allowed). Make 
one vertex for each of the four colors. For each pair of opposite faces on cube i, 
create an edge with label i joining the two vertices representing the colors of these 
two opposite faces. For opposite faces l\ = B , r\ = W on cube 1, we draw an edge 
labeled 1 between vertex B and vertex VP; for f\ = R ,b\ = R, we draw a loop labeled 
1 at vertex R\ and for t\ = G, u\ — B, we draw an edge labeled 1 between vertices 
G and B. The edges for the other cubes are drawn similarly. Figure 2.22 shows this 
multigraph for the cubes in Figure 2.20. 
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Figure 2.22 



By the decomposition principle, we can break the puzzle into a left-right part 
and a front-back part. We initially consider just the left-right part — that is, find one 
pair of opposite faces on each cube so that the left and right sides of the pile have one 
face of each color. 

Let us simplify this left-right problem slightly by asking only for a set of four 
opposite-face pairs, one pair from each cube, such that among this total set of eight 
faces each color appears twice. Later we will show how to ensure that each color 
appears once on the left side and once on the right side. Since a color corresponds 
to a vertex, a cube to an edge number, and a pair of opposite faces to an edge, this 
simplified left-right problem has the following graph-theoretic restatement: find four 
edges, one with each number, such that the family of eight end vertices of these four 
edges contains each vertex twice. 

This condition on the end vertices of the four edges is equivalent to requiring 
that the subgraph formed by these four edges has each vertex of degree 2 (a self-loop 
counts as degree 2 at its vertex). Note that a subgraph with all vertices of degree 2 is 
just a circuit or collection of disjoint circuits (a self-loop is a circuit of length 1). In 
an n- vertex multigraph, a set of n edges forming disjoint simple circuits is called a 
factor. Observe that a factor is a natural generalization of a Hamilton circuit. In our 
Instant Insanity model, let the term labeled factor denote a factor in which each edge 
number appears once. In Figure 2.23 we show three possible labeled factors for the 
multigraph in Figure 2.22. 

The simplified left-right problem in our Instant Insanity graph model reduces to: 
find a labeled factor. 



Figure 2.23 
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Figure 2.24 



7 

C\\ 

/T" 

6 

7 

f<! 

i 


/: 

7 

'f'j 

/r" 

7 

i 

v'l 

1 

p 

7 


Cube 1 


Cube 2 


Cube 3 


Cube 4 


We next show how a labeled factor can be transformed into an arrangement of 
the cubes in which the left and right sides of the pile have one face of each color. 
We direct the edges in each circuit in a consistent direction — say, in the clockwise 
direction (see Figure 2.23). Consider the labeled factor in Figure 2.23 a. As we go 
around this circuit, we arrange each cube with the color of the tail vertex of an edge 
as the color on the left side of the cube, and the color of the head vertex on the 
right side. One can start with any edge on the circuit; we pick the edge labeled 1. 
This 1-edge in Figure 2.23a goes from B to G, and so we arrange cube 1 so that I* 
(the left face of cube 1 in the pile) = B and r* = G (see Figure 2.24). Following 
the 1-edge on the circuit (in the clockwise orientation of the circuit), we next en- 
counter a 4-edge from G to R. Accordingly we arrange cube 4 so that = G and 
r\ — R. Next comes a 2-edge from R to W followed by a 3-edge from W to B. So 
we arrange cubes 2 and 3 with l^ = R,r^ — W and /* = W. = B. Figure 2.24 
shows the left and right sides (only) of the pile made by the four cubes arranged 
as just described. 

This process assures that each color appears once on each side, since each vertex 
(color) is at the head of one edge and at the tail of one edge. If we had chosen the 
labeled factor in Figure 2.23 b, we would have used the clockwise traversal procedure 
for all three circuits, yielding /* = B , r* = G , l\ = G, r% = B for one circuit, and 
l* = = R and Z| = r* = W for the two self-loops. 



Figure 2.25 0 
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We are now ready to solve the Instant Insanity puzzle. Restating the decomposi- 
tion principle in terms of labeled factors: 


Graph-Theoretic Formulation of Instant Insanity 


1. Find two edge-disjoint labeled factors in the graph of the Instant Insanity 
puzzle, one for left-right sides and one for front-back sides. 

2. Use the clockwise traversal procedure to determine the left-right and 
front-back arrangements of each cube. 


It is not hard to find two disjoint labeled factors by inspection. The three labeled 
factors in Figure 2.23 all use the same 1-edge (between B and G). So no two of these 
three factors are disjoint. The easiest approach is to find one labeled factor, delete its 
edges, and look for a second labeled factor. If none is found, start with a different 
labeled factor. For example, suppose we use the labeled factor in Figure 2.23 a as our 
first factor. After deleting its edges from the graph in Figure 2.22, we easily find a 
second labeled factor. Such a second labeled factor is shown in Figure 2.25 (the reader 
should be able to find a second factor). 

We use the factor from Figure 2.23a to arrange the left and right sides, as shown 
in Figure 2.24. Next we rotate each cube about the centers of its left and right faces to 
arrange the front and back faces according to a clockwise traversal of the circuits in 
Figure 2.25. Starting with the 2-edge, we make / 2 * = W ,b\ = B, /* = If h* = G, 
/ 4 * = G. If — W and f* — b\ — R. Figure 2.26 shows the resulting solution of the 
Instant Insanity puzzle. Now go buy or borrow a set of Instant Insanity cubes and 
show your friends that you learned something really useful from this book! 



Figure 2.26 
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SUPPLEMENT EXERCISES 


Find all other labeled factors of the multigraph in Figure 2.22 besides the ones in 
Figure 2.23 and Figure 2.25. Give an argument in the process to show that there 
can be no other labeled factors. 


2. Find all Instant Insanity solutions (a disjoint pair of labeled factors) to the game 
with associated graph (a). 


(a) (b) (c) 




3. Find all Instant Insanity solutions to the game with the associated graph (b) shown 
above. 

4. Show carefully that Instant Insanity graph (c) shown above does not possess a 
solution, that is, a pair of disjoint labeled factors. 

5. (a) If a graph G has a Hamiltonian circuit, must it also have a factor? Prove true 

or give a counterexample. 

(b) Repeat part (a) for the case of G having an Euler cycle. 

6. Find a factor in the following graphs, if possible: 



7. Show that the definition of a factor in a graph implies that each vertex is incident 
to exactly two edges (or one self-loop) in a factor. 


CHAPTER 3 

TREES AND SEARCHING 



The most widely used special type of graph is a tree. There are two ways to define 
trees. In undirected graphs, a tree is a connected graph with no circuits. Alternatively, 
one can define a tree as a graph with a designated vertex called a root such that there 
is a unique path from the root to any other vertex in the tree. This second definition 
applies to directed as well as undirected graphs. Theorem 1 proves the equivalence 
of these two definitions. 

Intuitively, a tree looks like a tree. See the examples of trees in Figure 3.1. The 
vertex labeled a is a root for each of the trees in Figure 3.1. The equivalence of the 
above two definitions is proved in Exercise 5. To illustrate the equivalence, observe 
that in the tree in Figure 3.1a, the addition of an edge (g~, h) would create a circuit 
(when edge directions are ignored) and would simultaneously create a second path 
from root a to h via g. 

Trees are a remarkably powerful tool for organizing information and search pro- 
cedures. In this chapter, we survey some of the diverse settings in which trees can 
be used to represent and analyze search procedures. These settings include solv- 
ing puzzles (Section 3.2), solving the “traveling salesperson” problem (Section 3.3), 
and sorting lists (Section 3.4). In the next chapter, trees play a critical role in sev- 
eral of the network algorithms. In this first section, we present some basic proper- 
ties of trees and introduce some convenient terminology for working with trees. We 
prove several useful counting formulas about trees and illustrate some uses of these 
formulas. 

Observe that if a tree is an undirected graph (with no directed edges), then any 
vertex can be the root. For example, the tree in Figure 3. lb is drawn so that a appears 
to be a root, but the tree in Figure 3.1c, which is a redrawing of Figure 3.1£>, has no 
single vertex that is a natural root — that is, any vertex can be the root. 

In most of this chapter we will be using trees with directed edges. Following 
common terminology, we call a directed tree a rooted tree. A rooted tree T has a 
unique root, for if vertices a and b were both roots of T, then there would be paths 
from a to b and from b to a forming a circuit. An undirected tree is unrooted in the 
sense that it has no one particular root. An undirected tree can be made into a rooted 
tree by choosing one vertex as the root and then directing all edges away from the 
root. For example, to root the undirected tree in Figure 3.1£> at vertex a, we would 
simply direct all the edges from left to right. 
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Figure 3.1 

The standard way to draw a rooted tree T is to place the root a at the top of the 
figure. Then the vertices adjacent from a are placed one level below a, and so on, as 
in Figure 3.1a. We say that the root a is at level 0, vertices b and c in that tree are at 
level 1 , vertices d, e, /, and g in that tree are at level 2, and so forth. The level number 
of a vertex x in T is the length of the (unique) path from the root a to x. 

For any vertex x in a rooted tree T, except the root, the parent of x is the vertex 
y with an edge (v7x) into x (the unique edge directed into x). The children of x are 
vertices z with an edge directed from x to z. Children have level numbers 1 greater 
than x. Two vertices with the same parent are siblings. The parent-child relationship 
extends to ancestors and descendants of a vertex. In Figure 3.1a, vertex e has b as its 
parent, h and i as its children, d and/as its siblings, a as its other ancestor, and / as its 
other descendant. Observe that each vertex a: in a tree T is the root of the subtree of x 
and its descendants. For easy reference, there is a glossary of tree-related terminology 
at the end of this text. 

We now show that the two definitions of an (undirected) tree presented at the 
start of the chapter, and two additional characterizations of trees, are equivalent. 

Theorem 1 

Let T be a connected graph. Then the following statements are equivalent: 

(a) T has no circuits. 

(b) Let a be any vertex in T . Then for any other vertex x in T, there is a unique path 
P x between a and x . 

(c) There is a unique path between any pair of distinct vertices x, y in T . 

(d) T is minimally connected, in the sense that the removal of any edge of T will 
disconnect T . 

Proof 

First note that (b) and (c) are immediately equivalent; let vertex a in (b) be vertex y 
in (c). We shall show that (a) implies (d), (d) implies (c), and (c) implies (a). 

(a) 4 (d): Suppose that the removal of some edge (a, b) does not disconnect T . Then 
the graph T — (a, b) contains a path P between a and b. However, the edges of 
P plus edge (a, b) form a circuit, contradicting (a). 
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(d) ^ (c): Suppose there are two different paths Pi, Pi between two vertices a, b 
in T. Let e — (u, v) be the first edge on P\ , starting from a , that is not on Pi 
(possibly, u — a or v — b). T — e is disconnected by (d), so u and v must be in 
different components. But following P2 from u to h and then coming back along 
Pi from b to v creates a path Q connecting u and v. Note: To make sure Q does 
not repeat any edges, we actually should follow Pi until the first vertex w also on 
Pi is encountered (such a w might occur before we get to b) and then go back on 
Pi to v. 

(c) ^ (a): T cannot contain a circuit, because the edges of a circuit provide two 
different paths joining any two vertices that lie on the circuit. 4 

As minimally connected graphs, trees have as few edges as possible. The follow- 
ing theorem gives a simple formula for the number of edges in an 77-vertex tree. 

Theorem 2 

A tree with n vertices has n - 1 edges. 

Proof 

Assume that the tree is rooted; if undirected, make it rooted as described above. We 
can pair off a vertex x with the unique incoming edge (y",x) from its parent y. Since 
each vertex except the root has such a unique incoming edge, there are n — 1 nonroot 
vertices and hence n — 1 edges. ♦ 

Vertices of T with no children are called leaves of T. Vertices with children are 
called internal vertices of T. If every internal vertex of a rooted tree has m children, 
we call T an m- ary tree. If m = 2, T is a binary tree. 

Theorem 3 

Let T be an 777-ary tree with n vertices, of which i vertices are internal. Then, 
n = mi + 1. 

Proof 

Each vertex in a tree, other than the root, is the child of a unique vertex (its parent). 
Each of the i internal vertices has m children, and so there are a total of mi children. 
Adding the one nonchild vertex, the root, we have n = mi + 1. ♦ 

Corollary 

Let T be an 777-ary tree with 11 vertices, consisting of i internal vertices and / leaves. 
If we know one of 77, i, or /, then the other two parameters are given by the following 
formulas: 

(a) Given i, then / = (m — 1)7 + 1 and n — mi + 1 • 

(b) Given /, then i = (l — l)/(777 — 1) and n = (ml — T)/(m — 1). 

(c) Given 77, then i = (n — \)/m and / = [(777 — 1)77 + 1 ] / 777 . 
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Figure 3.2 

The proof of the corollary’s formulas follow directly from n = mi + 1 (Theorem 3) 
and the fact that l + i — n. Details are left to Exercise 8. 

Example 1 

If 56 people sign up for a tennis tournament, how many matches will be played in the 
tournament? 

The tournament proceeds in a binary-tree-like fashion. The entrants are leaves and 
the matches are the internal vertices. See Figure 3.2. Given / = 56 and m — 2, we de- 
termine i from part (b) of the corollary: i = (Z — I )/(m — 1) = (56 — l)/(2 — 1) = 55 
matches. ■ 

Example 2 

Suppose a telephone chain is set up among 100 parents to warn of a school closing. 
It is activated by a designated parent who calls a chosen set of three parents. Each of 
these three parents calls given sets of three other parents, and so on. How many parents 
will have to make calls? Repeat the problem for a telephone tree of 200 parents. 

Such a telephone chain is a rooted tree with 100 vertices. An edge corresponds 
to a call and an internal vertex corresponds to a parent who makes a call. Since we 
know n = 100 and that the tree is ternary (3-ary), part (c) of the corollary can be used 
to determine i, the number of callers: i = (n — 1 )/m — (100 — l)/3 = 33. 

When we repeat the computation for an organization of 200 people, we get i = 
(200— l)/3 = 66 1 internal vertices. By Theorem 3, a ternary tree must have a number 
of vertices n equal to 3i + 1 , for some i — that is, n= 1 (mod 3). But 200 = 2 (mod 3), 
and so we do not have a true ternary tree . Either 1 99 or 202 parents would give a ternary 
tree. As a practical matter, with 200 people there will be 66 parents who each make 
3 calls and one parent who makes just one call (one internal vertex with one child). ■ 

The height of a rooted tree is the length of the longest path from the root or, 
equivalently, the largest level number of any vertex. A rooted tree of height h is called 
balanced if all leaves are at levels h and h — 1. Balanced trees are “good” trees. The 
telephone chain tree in Example 2 should be balanced to get the message to everyone 
as quickly as possible. A tennis tournament’s tree should be balanced to be fair; 
otherwise some players could reach the finals by playing several fewer matches than 
other players. Making an m- ary tree balanced will minimize its height (Exercise 12). 
The tree in Figure 3.1 b, with a as root, is a balanced binary tree of height 3. 
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Theorem 4 

Let T he an m-ary tree of height h with / leaves. Then, 

(a) / < m h , and if all leaves are at height h,l =m h . 

(b) h > flog,,, /l , and if the tree is balanced, h = |"log m /"| . 


Proof 

The expression fr] denotes that the smallest integer > r; that is, [>] rounds r up to 
the next integer. 

(a) An m- ary tree of height 1 has m 1 = m leaves (children of the root). Now we use 
induction on h to show that an m- ary tree of height h has at most m h leaves, with 
I = m h if all leaves are at level h. An ra-ary tree of height h can be broken into 
m subtrees rooted at the m children of the root. See Figure 3.3. These m subtrees 
have height of at most h — 1 . By induction, each of these subtrees has at most 
nf leaves, and if all leaves are at height ft — 1 in the subtrees, each has exactly 
m h ~ l . The m subtrees combined have at most m x m h ~ l = m h leaves, and if all 
leaves are at height ft, there are exactly m h leaves. 

(b) Taking the logarithm base m on both sides of the inequality / < m h yields log m Z < 
ft. Since ft is an integer, we have |"log„,/"| < ft. If the tree is balanced with height 
ft, then the largest possible value for / is / = m h (if all leaves are at level ft), and 
the smallest possible value is / = {m h ~ l — 1 ) + m (with m leaves at level ft and 
the rest at level ft — 1). So m h ~ l < l <m h . Taking logarithms on both sides yields 
ft - 1 < log m l < ft, or ft = riog m / 1 . * 

The most common use of trees is in searching. The following two sequential 
testing examples, one a basic computer science problem and the other a logical puzzle, 
illustrate the use of trees in searching. 

Example 3 

Let us reexamine the dictionary look-up problem discussed in Example 2 of Section 
1.1. We want to identify an unknown word (number) X by comparing it to words in a 
set (dictionary) to which X belongs. This time our comparison test will be a three-way 
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H 


N 

Figure 3.4 a c e g i k m 

branch (less than, equal to, greater than). The test procedure can be represented by a 
binary, or almost binary, tree. If X were one of the first 14 letters of the alphabet, then 
Figure 3.4 is such a binary search tree. Each vertex is labeled with the letter tested 
at that stage in the procedure. The procedure starts by testing X against H. The left 
edge from a vertex is taken when X is less than the letter and the right edge when X is 
greater. Such a tree may have one internal vertex with just one child if the number of 
vertices is even (as is the case for vertex N in Figure 3.4). This tree is built by making 
the middle letter in the list (in this case, G or H ) the root. The left child of the root is 
the middle letter in the left subtree (in this case, D ), and so on. 

To minimize the number of tests needed to recognize any X, that is, the height 
of the search tree, we should make the tree balanced. Suppose that X were known to 
belong to a set of n “words.” What is the maximum number of tests that would be 
needed to recognize XI By the corollary to Theorem 3, a binary search tree with n 
vertices has 

; (2-l)n+l n+ 1 

2 2 

leaves. Then the maximum number of tests needed to recognize X is the height 
of a balanced |(n+ l)-leaf search tree. By Theorem 4, h = f ~log 2 [(n+ 1)/2]~| = 
\log 2 {n+ 1)1-1. . 



Example 4 

A well-known logical puzzle has n coins, one of which is counterfeit — too light or too 
heavy — and a balance to compare the weight of any two sets of coins (the balance can 
tip to the right, to the left, or be even). For a given value of n, we seek a procedure for 
finding the counterfeit coin in a minimum number of weighings. Sometimes one is 
told whether the counterfeit coin is too light or too heavy. If we are told that the fake 
coin is too light, how many weighings are needed for n coins? 

Our testing procedure will form a tree in which the root is the first test, the other 
internal vertices are the other tests, and the leaves are the solutions — that is, which 
coin is counterfeit. See the testing procedure in Figure 3.5 for eight coins. The coins 
are numbered 1 through 8, and the left edge is followed when the left set of coins in 
a test is lighter, the middle edge when both sets have the same weight, and the right 
edge when the right set of coins is lighter. Note that when weighing 4 and 5 (and 
already knowing that 1, 2, 3, 6, 7, 8 are not the light coin), the balance cannot be even. 
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The test tree is ternary, and with n coins there will be n leaves — that is, n different 
possibilities of which coin is counterfeit. Theorem 4 tells us that the test tree must 
have height at least |"log 3 n] to contain n leaves. For the light counterfeit coin problem, 
this bound can be achieved by successively dividing the current subset known to have 
the fake coin into three almost equal piles and comparing two of the piles of equal 
size, as in Figure 3.5. 

If the counterfeit coin could be either too light or too heavy, then the problem 
is harder (see Exercise 27). A particular coin will appear at 2 leaves in the test tree, 
once when the coin is determined to be too light and another time when too heavy. ■ 

Determining the number of leaves and height of more complex search trees is 
a major concern in the field of computer science called analysis of algorithms. In 
Section 3.4, we determine the number of leaves and height of search trees that arise 
when sorting a list of n items. Recurrence relations for the number of leaves and height 
of some other search trees are discussed in Section 7.2 in the enumeration part of this 
book. We conclude this section with a formula for the number of different undirected 
trees on n labeled vertices. Let the labels be the numbers 1 through n. For example, 
there are three different labeled trees on three labels. Each 3-label tree is a path of two 
edges with the difference being which of the three labels is the one middle vertex: 
1-2-3, 1-3-2, and 2-1-3 (switching the position of the two leaves does not produce 
a different tree). While the formula was first proved by Cayley, we present a simpler 
proof due to Prufer. 

Theorem 5 (Cayley, 1889) 

There are n"~ 2 different undirected trees on n labels. 

Proof 

Observe that n"~ 2 is the number of sequences of the n labels of length n — 2. We now 
construct a one-to-one correspondence between trees on n labels and (n — 2)-length 
sequences of the n labels. Recall that for simplicity, we let the labels be the numbers 
1 , 2 ,...,«. 

For any tree on n numbers, we form a sequence (.v | , .s '2 , . . . , 2 ) of length n — 2 

as follows. Let l\ be the leaf in the tree with the smallest number and let ,v 1 be the 
number of the one vertex adjacent to it. For the tree in Figure 3.6, the leaf with 
the smallest number is 1 and the number of its neighboring vertex is 6. So ,v 1 = 6. 
We delete leaf f from the tree and repeat this process. For the tree in Figure 3.6, h, 
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Figure 3.6 



the smallest numbered leaf in the tree after 1 is deleted is 4, and its neighbor is 2. 
So S2 = 2. Continuing we have / 3 = 5 and .V 3 = 2, then I4 = 2 and S4 = 3, then I5 = 6 
and S 5 = 3, and then / f , = 7 and ,v fi = 3. We stop when the remaining tree has been 
reduced to two leaves joined by an edge. The 6 -label sequence for the 8 -label tree in 
Figure 3.6 is thus ( 6 , 2, 2, 3, 3, 3). Such sequences are called Prufer sequences. 

Next we show that any such ( n — 2)-length sequence of n items defines a unique 
n-item tree. We simply reverse the procedure in the preceding paragraph used to 
build the sequence. Observe that leaves (vertices of degree 1) will never appear in the 
sequence. The first number of the sequence is the neighbor of the smallest numbered 
leaf. From what we just observed, this smallest numbered leaf is the smallest number 
that does not appear in the sequence. For the sequence ( 6 , 2, 2, 3, 3, 3), 1 is the smallest 
number not in the sequence, so 1 is the leaf with 6 as its neighbor. 

Now we set the smallest leaf (label 1 ) aside (its position in the tree — a leaf adjacent 
to the first item in the sequence — is determined) and we consider the first item (item 6 ) 
as a leaf that will be adjacent to some item in the remaining sequence. We then repeat 
the process of identifying the smallest leaf in the remaining (n — 1 (-label tree specified 
by the remaining ( n — 3)-label sequence. For the remaining sequence (2, 2, 3, 3, 3), 
label 4 is the smallest of the remaining numbers (label 1 has been deleted) not in the 
sequence, so item 4 is a leaf and is adjacent to label 2, the first number of the remaining 
sequence. Continuing in the reduced sequence (2, 3, 3, 3), label 5 is the smallest leaf 
of the remaining numbers and is adjacent to label 2. Now label 2 becomes a potential 
“leaf” with respect to the remaining sequence (3, 3, 3). Note that the available leaves 
are currently labels 2, 6 , 7, 8 . So label 2, the smallest available leaf, is adjacent to 
item 3. In the sequence (3,3), label 6 is adjacent to label 3. Finally, label 7 is adjacent 
to label 3. There remain only labels 3 and 8 , which must be adjacent to each other. 

The preceding construction of a labeled tree from the given Prufer sequence can 
be applied to any Prufer sequence. Thus the correspondence between / 7 -label trees 
and sequences of length n — 2 is one-to-one and the theorem is proved. ♦ 
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Summary of Exercises Exercises 3-18 present theory about trees. 
Exercises 19-29 involve various modeling problems with trees. 


1. Draw all nonisomorphic trees with 

(a) Four vertices (b) Five vertices (c) Six vertices 

2. Suppose a connected graph has 20 edges. What is the maximum possible number 
of vertices? 
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3. Show that all trees are 2-colorable. 

4 . Show that all trees are planar. 

5 . Show that an undirected connected graph G is a tree (i.e., has a unique path from 
root to each vertex) if any one of the following conditions holds: 

(a) G has n vertices and n — 1 edges. 

(b) G has fewer edges than vertices. 

(c) Removal of any edge disconnects G. 

6. Reprove Theorem 2 by using the fact that trees are planar (Exercise 4) and Euler’s 
formula (Theorem 2 in Section 1.4). 

7 . Show that any tree with more than one vertex has at least two vertices of degree 1 . 

8. Prove the following parts of the corollary to Theorem 3: 

(a) Part (a) (b) Part (b) (c) Part (c). 

9 . Reprove that / < m h in an in- ary tree of height h by counting the maximum 
possible number of choices at each internal vertex when building a path from the 
root to a leaf. 

10 . What is the maximum number of vertices (internal and leaves) in an in- ary tree 
of height hi 

11 . Show that the fraction of internal vertices among all vertices in an in- ary tree is 
about 1 /in. 

12 . For a given h, show that the height of an m-ary tree with k leaves is minimized 
when the tree is balanced. 

13 . What is the size of the largest and smallest numbers of vertices of degree 1 
possible in an n-vertex tree, for n > 2? 

14 . Show that a graph, not necessarily connected, is a tree if it has no circuits but the 
addition of any edge (between two existing vertices) always creates a circuit. 

15 . Show that the size of the largest independent set (defined in Example 5 of Section 
1.1) in an n-vertex tree is at least n/2. 

16 . A forest is an unconnected graph that is a disjoint union of trees. If G is an 
n-vertex forest of t trees, how many edges does it have? 

17 . Show that the sum of the level numbers of all / leaves in a binary tree is at least 
/ riog 2 /] , and hence the average leaf level is at least [Tog 2 /j • 

18 . Let T be an undirected tree. If the choice of vertex x to be the root yields a rooted 
tree of minimal height, then x is call a center of T. Show that any undirected tree 
has at most two centers. 

19 . Consider the problem of summing n numbers by adding together various pairs 
of numbers and/or partial sums, for example, {[(3 + 1) + (2 + 5)] + 9}. 

(a) Represent this addition process with a tree. What will internal vertices rep- 
resent? 

(b) What is the smallest possible height of an “addition tree” for summing 100 
numbers? 


102 


Chapter 3 Trees and Searching 


20. If 70s a balanced 5-ary tree with 80 internal vertices, 

(a) How many leaves does T have? 

(b) How many edges does T have? 

(c) What is the height of T1 

21. Every year, the NCAA basketball championship tournament features 64 teams 
that must compete for the national title. There are 16 teams from the East, 16 
teams from the South, 16 teams from the West and 16 teams from the Midwest. 
In each division (East, South, West, and Midwest), the teams compete in a single 
elimination tournament to determine the best team from their respective division. 
Then, the top East team plays the top South team and the top West team plays 
the top Midwest team. The winners of these games go on to play a final match 
to determine the champion. How many games are played in total? 

22. What type of search procedure is represented by the search tree below? 


B 

\ 

c 

23. A tree can be used to represent a binary code; a left branch is a 0 and a right 
branch a 1. The path to a letter (vertex) is its binary code. To avoid confusion, one 
sometimes requires that the initial digits of one letter’s code cannot be the code 
of another letter (e.g., if K is encoded as 0101, then no letter can be encoded as 0 
or 01 or 010). Under this requirement, which vertices in a tree represent letters? 
How many letters can be encoded using n-digit binary sequences? 

24. Suppose that each player in a tennis tournament (like the binary-tree tournament 
in Example 1) brings a new can of tennis balls. One can is used in each match 
and the other can is taken by the match’s winner along to the next round. Use 
this fact to show that a tennis tournament with n entrants has n— 1 matches. 

25. Consider a tennis tournament T (with the tree structure illustrated in Figure 3 . 2 ) 
with 32 entrants. 

(a) How many players are eliminated (lose) in the first two rounds of matches? 

(b) Suppose that the losers in the first two rounds of the tournament qualify for 
a losers’ tournament T' . How many players are eliminated in the first two 
rounds of matches in T'l Note that all tennis tournaments are balanced trees; 
the number of people playing matches in each round after the first round is 
a power of 2. 

(c) Suppose that the people who lose in the first two rounds of T' qualify for 
another losers’ tournament T". How many players are eliminated in the first 
two rounds of matches in T"1 

(d) Suppose that the people who lose in the first two rounds of T" qualify for 
another losers’ tournament and so on until finally there is just one grand 
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loser (the last tournament has two people). How many losers’ tournaments 
are required to determine this grand loser? 

26 . Suppose that a chain letter is started by someone in the first week of the year. 
Each recipient of the chain letter mails copies on to five other people in the next 
week. After six weeks, how much money in postage (40i/ a letter) has been spent 
on these chain letters? 

27. (a) Repeat Example 3 assuming now that only a two-way branch (less than, 

greater than or equal to) is available. Draw a balanced search tree for the first 
1 3 letters and determine the height of an n-letter search. 

(b) Suppose a two-way branching search tree for letters A, B, C, D , E is to 
take advantage of the following letter frequencies: A 20%, B 20%, C 30%, 
D 10%, E 20%. Build a two-way tree that minimizes the average number of 
tests required to identify a letter. 

28 . Repeat Example 4 for 20 coins with at most one too light. 

29 . Suppose we have four coins and possibly one coin is either too light or too heavy 
(all four might be true). 

(a) Show how to determine which of the nine possible situations holds with just 
two weighings, if given one additional coin known to be true. 

(b) Show that two weighings are not sufficient without the extra true coin. 

30 . Let T be an m- ary tree with n vertices, consisting of i internal vertices and / 
leaves. Suppose that m is an even number. Show that n always has to be an odd 
number. Give two (small) examples for the same value of m illustrating that i 
can be either even or odd. 

31 . In the proof of Theorem 5, we showed that a Prufer sequence (si , S 2 , . . . , .v„ — 2 ) 
uniquely described a tree on n items. Construct the trees with the following Prufer 
sequences. 


(a) (4, 5, 6, 2) (b) (2, 8, 8, 3, 5, 4) (c) (3, 3, 3, 3, 3, 3) 



3.2 SEARCH TREES AND SPANNING TREES 

Trees provide a natural framework for finding solutions to problems that involve a 
sequence of choices, whether hunting through a graph for a special vertex or finding 
one’s way out of a maze or searching for the cheapest solution to a vehicle rout- 
ing problem. Most of the problems in the two preceding chapters — isomorphism, 
Hamilton circuits, minimal colorings, and placing police on street corners — require 
tree-based searching for computerized solutions. By letting the sequential choices be 
internal vertices in a rooted tree and the solutions and “dead ends” be the leaves, we 
can organize our search for possible solutions. Whether searching in a graph or in a 
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maze or through all solutions to an optimization problem, the foremost concern in the 
search procedure is that it be exhaustive — that is, guaranteed to check all possibilities. 

In this section we present applications of tree searches that involve games rather 
than operations research applications. Most operations research tree enumeration 
problems involve very large trees and use special tree “pruning” algorithms. As an 
example of such applications, we solve a small Traveling Salesperson problem in 
Section 3.3. The next chapter, “Network Algorithms,” discusses four important opti- 
mization algorithms that implicitly use trees to search through graphs. 

We start with searching in a graph. In many applications, graph algorithms are 
needed to test whether a graph has a certain property, such as connectedness or 
planarity, or to count all occurrences of a given structure, such as circuits or complete 
subgraphs. The algorithms usually employ a spanning tree in searching among vertices 
and edges of a graph to check for these properties or structures. A spanning tree of a 
graph G is a subgraph of G that is a tree containing all vertices of G. Spanning trees 
can be constructed either by depth-first (backtrack) search or by breadth-first search. 

To build a depth-first spanning tree, we pick some vertex as the root and begin 
building a path from the root composed of edges of the graph. The path continues until 
it cannot go any further without repeating a vertex already in the tree. The vertex where 
this path must stop is a leaf. We now backtrack to the parent of this leaf and try to build 
a path from the parent in another direction. When all possible paths from this parent y 
and its other children have been built, we backtrack to the parent of v, and so on, until 
we come back to the root and have checked all other possible paths from the root. 

To build a breadth-first spanning tree, we pick some vertex x as the root and put 
all edges leaving x (along with the vertices at the ends of these edges) in the tree. 
Then we successively add to the tree the edges leaving the vertices adjacent from x, 
unless such an edge goes to a vertex already in the tree. We continue this process in 
a level-by-level fashion. 

It is important to note that if the graph is not connected, then no spanning tree 
exists. We thus have the following algorithm as a result. 

Algorithm to Test Whether an Undirected Graph Is Connected 

Use a depth-first or breadth- first search to try to construct a spanning tree. If all vertices 
of the graph are reached in the search, a spanning tree is obtained and the graph is 
connected. If the search does not reach all vertices, the graph is not connected. 

A formal proof that depth-first (breadth-first) searching does search all vertices 
in a connected graph is left as an exercise (Exercise 9). 

An adjacency matrix of an (undirected) graph is a (0,l)-matrix with a 1 in entry 
(i, j) if vertex x, and vertex x/ are adjacent; entry (i, j) is 0 otherwise. 

Example 1: Testing for Connectedness 

Is the undirected graph G whose adjacency matrix is given in Figure 3.7 a connected? 

Let us perform a depth-first search of G starting with x\ as the root. At each 
successive vertex, we pick the next edge on the tree to be the edge going to the lowest 
numbered vertex not already in the tree. So from x\ we go to X 2 - 
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Figure 3. 7 

(b) 







(C) 

From X2 

we go to X3. Since 

x 3 

is not adjacent to any other vertex besides xi 


and X 2 (which are already in the tree), we backtrack from x 3 to x 3 and continue the 
search from X2, going to x 3 , then to xj, and then to X4. At x 4 we backtrack to x 7 and 
go to X8. From xs, we must backtrack all the way back to xi. From x\, we go to xe- 
This finishes the search — all vertices have been visited. The spanning tree obtained 
is shown in Figure 3.1b. 

The result of a breadth-first search is shown in Figure 3.7c. ■ 

The computation time required to make a depth-first search of a graph is propor- 
tional to the number of edges in the graph (each edge in the spanning tree is traversed 
twice, and edges that cannot be used are tried just twice, once from each end vertex). 
If an undirected graph is not connected, then we can find its components (connected 
pieces) by applying a depth-first (or breadth-first) search at any vertex to find one 
component, then apply this search starting at a vertex not on the previous tree to 
find another component, and continue finding additional components until no unused 
vertices are left. 

We note one important property of a breadth-first search. A breadth-first spanning 
tree consists of shortest paths from the root to every other vertex in the graph. A proof 
of this claim is left to Exercise 8. If we only wanted to find a shortest path from the 
root to a particular vertex x in a graph, then the breadth-first search could stop as soon 
as x was reached (without trying to construct a full spanning tree). 

Now we apply the techniques of depth-first and breadth-first search to games. In a 
maze, the vertices will be the intersection points of paths. In puzzles, the vertices will 
be the different configurations of the puzzle and the edges will be the possible moves. 
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Figure 3.8 



Example 2: Traversing a Maze 

Consider the maze in Figure 3.8. We start at the location marked with an S and seek 
to reach the end marked with an E. 

We use a depth-first search. For mazes, there is a convenient rule of thumb (whose 
verification is left to Exercise 17) for constructing a depth-first search: stick to the 
right wall in the maze. When we come to a dead end, we follow the right wall to the 
end wall, along the end wall, and then backtrack along the left wall (now the right 
wall as we leave the dead end). When we come to a previously visited corner, we put 
an artificial (wiggly) dead-end wall to stop us from actually reaching that corner (as at 
S in Figure 3.8). In the maze in Figure 3.8 we use solid lines to indicate the (forward) 
pathbuilding and dashed lines for backtracking. Because the maze is easily searched 
directly, we have not drawn the search tree for this problem (in which S would be the 
root, other corners internal vertices, and dead ends and E the leaves). ■ 


If the tree of possible paths is large, then the breadth-first method quickly becomes 
unwieldy. The depth-first method that traces only one path at a time is much easier to 
use by hand or to program. Further, in cases where we need to find only one of the 
possible solutions, it pays to go searching all the way down a path for a solution rather 
than to take a long time building a large number of partial paths, only one of which in 
the end will actually be used. On the other hand, when we want a solution involving 
a shortest path or when there may be very long dead-end paths (while solution paths 
tend to be relatively short), then the breadth-first method is better. All the network 
search algorithms in the next chapter use breadth-first searches. 

Example 3: Pitcher- Pouring Puzzle 

Suppose we are given three pitchers of water, of sizes 10 quarts, 7 quarts, and 4 quarts. 
Initially the 10-quart pitcher is full and the other two empty. We can pour water from 
one pitcher into another, pouring until the receiving pitcher is full or the pouring 
pitcher is empty. Is there a way to pour among pitchers to obtain exactly 2 quarts in 
the 7- or 4-quart pitcher? If so, find a minimal sequence of pourings to get 2 quarts 
in the 7-quart or 4-quart pitcher. 
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( 0 , 4 ) ( 2 , 4 ) ( 3 , 4 ) ( 4 , 4 ) ( 6 , 4 ) 



( 10 , 0, 0 ) 



( 6 , 0 , 4 ) ( 3 , 7 , 0 ) 



( 6 , 4 , 0 ) ( 0 , 6 , 4 ) ( 0 , 7 , 3 ) ( 3 , 3 , 4 ) 


(b) 


Figure 3.9 

The positions, or vertices, in this enumeration problem are ordered triples ( a,b,c ), 
the amounts in the 10-, 7-, and 4-quart pitchers, respectively. Actually, it suffices 
to record only ( b , c), the 7- and 4-quart pitcher amounts, since a — 10 — h — c. A 
directed edge corresponds to pouring water from one pitcher to another. Let us draw 
the tree on a b, c-coordinate grid as shown in Figure 3.9 a. The grid is bounded 
by b = 7, c = 4, b + c = 10. Pouring between the 10- and 7-quart pitchers will be a 
horizontal edge, between 10- and 4-quart pitchers a vertical edge, and between 7- and 
4-quart pitchers a diagonal edge with slope — 1 . The beginning of the same tree is 
shown in the standard form in Figure 3.9 b. 

The root of this search tree is (0, 0). Since we want a minimal sequence of 
pourings (a shortest path in the search tree), we will use a breadth-first search. From 
the root, we can get to positions (7, 0) and (0, 4). From (7, 0), we can get to new 
positions (7, 3) and (3, 4), and from (0, 4), we can get to new positions (6, 4) and 
(4, 0). The tree built thus far is shown in Figure 3.9 b. 

From (7, 3), the only new position is (0, 3), and from (3, 4), the only new position 
is (3, 0). From (6, 4), the only new position is (6, 0), and from (4, 0), the only new 
position is (4, 4). We have now checked all paths of length 3. The only new moves 
now are from (4, 4) to (7, 1) and from (6, 0) to (2, 4). But (2, 4) has 2 quarts in the 
7-quart pitcher. So (0, 0) to (0, 4) to (6, 4) to (6, 0) to (2, 4) is a sequence of pourings 
to obtain 2 quarts. ■ 


Example 4: Missonaries — Cannibals Puzzle 

Suppose three missionaries and three cannibals must cross a river in a two-person 
boat. Show how this can be done so that at no time do the cannibals outnumber the 
missionaries on either side (unless there are no missionaries on a shore). 

We use vertices to denote the states of the puzzle when the boat is on one of the 
shores. We label each vertex with an ordered pair (N m , N c ), where N m is the number 
of missionaries and N c the number of cannibals on the near shore. When the boat is on 
the near shore, we put an asterisk after the label. We start at vertex (3,3)*. Each edge is 
labeled with an arrow showing the direction of the boat along with who is in the boat: 
one m for each missionary in the boat and one c for each cannibal in the boat. The 
only legal initial moves from vertex (3,3)* are for two cannibals or else a missionary 
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(3, 3)i 
(3. 1) 
(3, 2)i 
(3, 0) 
(3, l)i 
( 1 , 1 ) 
(2, 2)i 
(0, 2) 
(0, 3)i 
( 0 , 1 ) 
(0, 2)i 

Figure 3.10 to.o) 


and a cannibal to cross the river with a cannibal left on the far shore and the other 
rower returning with the boat to the near shore. The readers should try to solve this 
puzzle for themselves. We show a solution to the game in Figure 3.10, which starts 
with two cannibals in the first crossing. The key step occurs on the sixth move when 
two people (a missionary and a cannibal) cross from the far shore to the near shore. ■ 


Suppose we have built a tree to provide the framework for searching or organizing 
information. Searching for a particular vertex or processing information in the tree 
normally involves a depth-first type of traversal of the spanning tree. However, there 
are several times during a traversal when internal vertices can be checked. A preorder 
traversal of a tree is a depth-first search that examines an internal vertex when the 
vertex is first encountered in the search. A postorder traversal examines an internal 
vertex when last encountered (before the search backtracks away from the vertex and 
its subtree). If a tree is binary, we can define an inorder traversal that checks an 
internal vertex in between the traversal of its left and right subtrees. Figures 3.11a 


1 8 




Figure 3.11 
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Figure 3.12 


and 3.1 lb display numberings of the vertices of the tree in Figure 3.1 la according 
to preorder and postorder traversals, respectively. (We assume the left child is visited 
before the right child at each internal vertex.) 

An important property of preorder and postorder traversals is that in a preorder 
traversal a vertex precedes its children and all its other descendants, whereas in a 
postorder traversal a vertex follows its children and all its other descendants. 

We now give examples of each type of traversal. In the binary search tree example 
of a dictionary look-up (Example 2 in Section 3.1), the alphabetical order of the 
vertices corresponds to inorder traversal. In searching through a graph for a vertex 
with a specified label, we should test each new vertex as it is encountered on a depth- 
first search to see if it has the desired label. There is no advantage to postponing such 
testing. Thus, in searching for a special vertex, vertices should be examined according 
to a preorder traversal. 

We demonstrate the need for postorder traversal with an arithmetic tree. The 
arithmetic expression 


(( a + b) + c)x (((q -t-M + e)) -f) + g) 

can be decomposed into a binary tree as shown in Figure 3.12. The internal vertices are 
arithmetic operations and the leaves variables. To evaluate this arithmetic expression, 
we need to execute the operations specified by internal vertices according to a pos- 
torder traversal of the arithmetic tree — that is, we cannot perform an operation until 
the subexpressions represented by the operation vertex’s two subtrees are evaluated. 



3.2 EXERCISES 


Summary of Exercises The first six exercises involve building 
spanning trees and the next five involve properties of spanning trees. Exercises 12-25 
are based on the puzzle examples in this section. Exercises 26-27 involve traversals. 


1. Find depth-first spanning trees for each of these graphs; 

(a) Kg (a complete graph on eight vertices) 

(b) The graph in Figure 2.5 in Section 2.2 
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(c) The graph in Figure 2.10 in Section 2.3 

(d) A'T j (a complete bipartite graph) 


2. Find breadth-first spanning trees of each of the graphs in Exercise 1 . 

3. Find all spanning trees (up to isomorphism) in the following graphs: 

(a) Figure 2.4 (b) K 4 (c) 



(d) ^ 3,3 (a complete bipartite graph) 

4. Test the graph whose adjacency matrix is given below to see if it is connected. 


Xl 

x 2 
x 3 

X4 

x 5 
Xf, 

Xl 
Xg 

5. Consider an undirected graph with 25 vertices x 2 , x 2 x 2 6 with edges (x,- , Xj) 

if and only if integers i and j have a common divisor. How many components 
does this graph have? Find a spanning tree for each component. 

6. Show that in an n-vertex graph, a set of n — 1 edges that form no circuits is a 
spanning tree. 

7. Show that a connected undirected graph with just one spanning tree is a tree. 

8. Show that a breadth-first spanning tree contains shortest paths from the root to 
every other vertex. 

9. (a) Prove that a depth-first search reaches all vertices in an undirected connected 

graph. 

(b) Repeat part (a) for breadth-first search. 

10. (a) Show that the height of any depth-first spanning tree of a graph G starting 
from a given root a must be at least as large as the height of a breadth-first 
spanning tree of G with root a. 

(b) By starting with different roots, it might be possible for a depth-first spanning 
tree of G to have a smaller height than a breadth-first spanning tree of G. 
Find a graph in which this can happen. 


X\ X 2 Xg X4 X; Xf, Xi Xg 


0 0 
0 0 
1 1 
0 0 
0 1 
1 0 
0 1 
1 0 


1 0 
1 0 
0 1 
1 0 
0 1 
0 1 
0 1 
0 0 


0 1 
1 0 
0 0 
1 1 
0 0 
0 0 
0 0 
1 0 


0 1 
1 0 
0 0 
1 0 
0 1 
0 0 
0 1 
1 0 


11. A cut-set is a set S of edges in a connected graph G whose removal disconnects 
G, but no proper subset of S disconnects G. Show that any cut- set of G has at 
least one edge in common with any spanning tree of G. 
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12. (a) Use a depth-first search to find your way through this maze from S to E. 

(b) How many ways through this maze (without cycles or backing up) are there? 

5 


E 

13. Repeat Example 3 using a depth-first search. 

14. Perform a breadth-first search of the graph in Figure 3.8 to find the shortest path 
from S to E. 

15. Find another (actually shorter) way in Example 3 to get 2 quarts in one pitcher. 

16. (a) Repeat Example 3 with pitchers of size 8, 5, and 3 with an objective of 4 

quarts in one pitcher. 

(b) Repeat Example 3 with pitchers of sizes 12, 8, and 5, and an objective of 2 
quarts in one pitcher. 

17. Show that the stick-to-the-right-hand-wall rule will always get one out of a two- 
dimensional maze. 

18. Use a depth-first search in Example 3 to show that any amount between 0 and 10 
quarts can be obtained in one of the pitchers. 

19. Suppose that a dog, a goat, and a bag of tin cans are to be transported across a 
river in a ferry that can carry only one of these three items at once (along with a 
ferry driver). If the dog and goat cannot be left alone on a shore when the driver 
is not present, nor can the goat and tin cans, find a scheme for getting all across 
the river. 

20. Three jealous wives and their husbands come to a river. The party must cross the 
river (from near shore to far shore) in a boat that can hold at most two people. Find 
a sequence of boat trips that will get the six people across the river without ever 
letting any husband be alone (without his wife) in the presence of another wife. 

21. Repeat Exercise 20 this time with four jealous wives and their husbands, if possible. 

22. Do Exercise 20, this time with four jealous wives and husbands and a three- 
person boat. 

23. Repeat Example 4 with the additional condition that only one of the cannibals 
can row (all the missionaries can row). 

24. The following is a variation of the pitcher-pouring problem. An hourglass is 
a timepiece in which a specified period of time will elapse as sand in the top 
half empties through the narrow opening to the bottom half. If an hourglass is 
designed to take 10 minutes for the sand to flow down and after, say, 7 minutes 
the glass is turned upside down, then the sand flow will be reversed and it will 
take 7 more minutes for the sand to empty. Suppose you have two hourglasses, 
a 4-minute hourglass and a 7-minute hourglass. Describe a scheme for starting 
them and turning them upside down so as to measure out a 9-minute period of 
time. {Hint: In the beginning, you start one or both hourglasses; then a decision 
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must be made whenever an hourglass empties whether to restart it and whether 
to start or invert the other hourglass.) 

25. A group of four people have to cross a bridge that will accommodate only two 
people at a time. It is night and there is one flashlight. Any party who crosses, 
either one or two people, must have the flashlight with it. (The flashlight must 
be walked back and forth; it cannot be thrown.) Find a way for all four to get to 
the other side of the bridge in 17 minutes, given that person A takes 1 minute to 
cross the bridge, B takes 2 minutes, C takes 5 minutes, and D takes 10 minutes. 
When two people cross, they must go at the speed of the slower person. 

26. List the vertices in order of a preorder traversal and a postorder traversal of 

(a) Figure 3.1a (b) Figure 3. lb (c) Figure 3.4 

27. Generalize the arithmetic tree in Figure 3.12 to include unary operations 
such as inverses or sin( ). Give the tree for the following expression: 
sin(((a + (( b x c) -1 + ((a + d) x e))) — (c + e)) + (a — b)~ l ) 

28. Let T be a spanning tree in a connected undirected graph G. Show that when 
any non-tree edge is added to T, a unique circuit results. 

29. Prove that any spanning tree T' of a graph G can be converted into any other 
spanning tree T" of G by a sequence of spanning trees 7) , T 2 , ... , T m where 
T" = T \ , T" — T m , and 7). is obtained from 7*_i by removing one edge of 7)_ 1 
that was in T' and adding one edge in T' (in is the number of edges by which 
T' and T" differ). (Hint: Use Exercise 28 and induction on m .) 

30. Suppose that during a preorder traversal of a binary tree T, we write down a 1 
for each internal vertex and a 0 for each leaf in the traversal, building a sequence 
of Is and Os. If T has n leaves, the sequence will have n Os and n — 1 Is. We call 
this sequence the characteristic sequence of T. (Such a sequence determines a 
unique tree.) 

(a) Find the binary tree with the characteristic sequence 110100110100100. 

(b) Prove that the last two digits in any characteristic sequence are 0s (assuming 
n > 2). 

(c) Prove that a binary sequence with n 0s and n — 1 Is, for some n, is a 
characteristic sequence of some binary tree if and only if the first k digits of 
the sequence contain at least as many Is as 0s, for 1 < k < 2n — 2. 

31. (a) Write a program for building a depth-first spanning tree of a graph whose 

adjacency matrix is given. 

(b) Repeat part (a) for a breadth-first spanning tree. 

32. Use a breadth-first inverted spanning tree T (edges directed from children to par- 
ents) to build an Euler cycle in a directed graph possessing an Euler circuit as fol- 
lows. Starting at the root of T, trace out a path such that at any vertex x we choose 
only the edge in T to the parent of x (in T) if there are no other unused edges leaving 
x at this stage. Verify the correctness of the algorithm. Program this algorithm. 
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3.3 THE TRAVELING SALESPERSON PROBLEM 

In this section, we illustrate the use of trees in graph optimization problems of oper- 
ations research. The traveling salesperson problem seeks to minimize the cost of the 
route for a salesperson to visit a set of cities and return to home. One seeks a minimal- 
cost Hamilton circuit in a complete graph having an associated cost matrix C. Entry 
Cy in C is the cost of using the edge from the /th vertex to the /th vertex. The traveling 
salesperson problem arises in many different guises in operations research, and is 
famously difficult (it is an NP-complete problem; see Appendix A. 5). One example is 
planning the movement of an automated drill press making holes at specified locations 
on printed circuit boards. 

We present two approaches to solving the traveling salesperson problem. Both 
approaches use trees, but in two very different ways. The first approach is to system- 
atically consider all possible ways to build Hamilton circuits in search of the cheapest 
one. It uses a “branch and bound” method to limit the number of different vertices in 
the search tree that must be inspected in search of a minimal solution. Each vertex in 
the tree will represent a partial Hamilton circuit, and each leaf a complete Hamilton 
circuit. Note that a complete graph on n vertices has (n — 1 )! different Hamilton cir- 
cuits; for example, a 50-city (vertex) problem has 49! «6 x 10 62 possible circuits. 
The “branch and bound” method reduces substantially the number of circuits that 
must be checked, although not enough to make it practical to solve large traveling 
salesperson problems. 

This leads to another strategy, our second approach, which is to construct a 
near-minimal solution using an algorithm that is quite fast. The huge amount of time 
required to find exact solutions to problems that involve enumeration of very large 
trees, such as the traveling salesperson problem, has led researchers to concentrate 
on heuristic, near-optimal algorithms for such problems. 

Note that the numbers in the cost matrix may themselves only be estimates — e.g., 
travel times by truck — and so any solution with these numbers is only an estimate. 
Finding a minimal coloring of any arbitrary graph or testing for isomorphism are 
other problems we have seen that have underlying search trees with similarly huge 
numbers of possible solutions. Near-minimal coloring algorithms have received much 
attention, but unfortunately, a “near-isomorphism” is usually of no value. 

We consider a small traveling salesperson problem with four vertices x\, X 2 , X 3 , 
X4. Let the cost matrix for this problem be the matrix in Figure 3.13. Entry Cy is the 
cost of going from vertex (city) i to vertex j. (Note that we do not require cy — Cji.) 
The infinite costs on the main diagonal indicate that we cannot use these entries. A 
Hamilton circuit will use four entries in the cost matrix C, one in each row and in 
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each column, such that no proper subset of entries (edges) forms a subcircuit. This 
latter constraint means that if we choose, say, entry C23, then we cannot also use C32, 
for these two entries form a subcircuit of length 2. Similarly, if entries C23 and C31 are 
used, then C12 cannot be used. 

We first show how to obtain a lower bound for the cost of this traveling salesperson 
problem. Since every solution must contain an entry in the first row, the edges of a 
minimal tour will not change if we subtract a constant value from each entry in the 
first row of the cost matrix (of course, the cost of a minimal tour will change by this 
constant). We subtract as large a number as possible from the first row without making 
any entry in the row negative; that is, we subtract the value of the smallest entry in 
row 1, namely 3. Do this for the other three rows also. We display the altered cost 
matrix in Figure 3.14. 

All rows in Figure 3.14 now contain a 0 entry. After subtracting a total of 3 + 
3 + 5 + 4= 15 from the different rows, a minimal tour using the cost data in Figure 
3.15 will cost 15 less than a minimal tour using original cost data in Figure 3.14. Still, 
the edges of a minimal tour for the altered problem are the same edges that form a 
minimal tour for the original problem. 

In a similar fashion, we can subtract a constant from any column without changing 
the set of edges of a minimal tour. Since we will want to avoid making any entries 
negative, we consider subtracting a constant only from columns with all current entries 
positive. The only such column in Figure 3.14 is column 4, whose smallest value is 
1. So we subtract 1 from the last column in Figure 3.14 to get the matrix in Figure 
3.15. Every row and column in Figure 3.15 now contains a 0 entry. 

The cost of a minimal tour using Figure 3.15 has been reduced by a total of 
15 + 1 = 16 from the original cost using Figure 3.13. We can use this reduction of 
cost to obtain a lower bound on the cost of a minimal tour: A minimal tour using the 
costs in Figure 3.15 must trivially cost at least 0, and hence a minimal tour using Figure 
3.13 must cost at least 16. In general, the lower bound for the traveling salesperson 
problem equals the sum of the constants subtracted from the rows and columns of 
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the original cost matrix to obtain a new cost matrix with a 0 entry in each row and 
column. 

Now we are ready for the branching part of the “branch and bound” method. We 
look at an entry in Figure 3.15 that is equal to 0. Say c\ 2 - Either we use c \2 or 
we do not use c\ 2 - We “branch” on this choice. In the case that we do not use c 12 , 
we represent the no-cn choice by setting cn = 00 . The smallest value in row 1 of 
the altered Figure 3.15 is now c 14 = 3, and so we can subtract this amount from row 
1. Similarly, we can subtract 1 from column 2. If we do not use entry c n, we obtain 
the new cost matrix in Figure 3.16. Hence, any tour for the original problem (Figure 
3.13) that does not use c \2 must cost at least 16+ (3 + 1) = 20. 

If we use c \2 to build a tour for Figure 3.15, then the rest of the tour cannot use 
another entry in row 1 or column 2 ; also entry C 21 must be set equal to 00 (to avoid 
the subcircuit xi — X 2 — Xi). The new smallest value of the reduced matrix in row 2 
is 1, and so we subtract 1 from row 2 to obtain the cost matrix in Figure 3.17. The 
lower bound for a tour in Figure 3.17 using C 12 is now 16 + 1 = 17. 

Since our lower bound of 17 is less than the lower bound of 20 when we do 
not use C 12 , we continue our consideration of tours using C 12 by determining bounds 
associated with whether or not to use a second entry with value 0. We will continue 
to extend these partial tours using C 12 as long as the lower bound is <20. If the lower 
bound were to exceed 20 , then we would have to go back and consider partial tours 
not using C 12 . Our tree of choices for this problem will be a binary tree whose internal 
vertices represent choices of the form: use entry c/j or do not use c, ; - (see Figure 3.19). 
At any stage, as long as the lower bounds for partial tours using c !; - are less than the 
lower bound for tours not using c,j , we do not need to look at the subtree of possible 
tours not using Cij . 

We extend a tour using C 12 by considering an entry with value 0 in Figure 3.17. 
This next 0 entry need not connect with C 12 — that is, need not be of the form c, i or 
C 2 j — but for simplicity we shall pick an entry in row 2. The only 0 entry in row 2 of 
Figure 3.17 is C 24 . 

Again we have the choice of using C 24 or not using C 24 . Not using C 24 will increase 
the lower bound by 2 (the smallest entry in row 2 of Figure 3.17 after we set C 24 = 00 ; 
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column 4 still has a 0). Using C 24 will not increase the lower bound, and so we further 
extend the partial tour using c 24 along with c\ 2 . Again we delete row 2 and column 4 
in Figure 3.17 and set C 41 = 00 (to block the subcircuit x\— X 2 — X 4 — xi). Figure 3.18 
shows the new remaining cost matrix (all rows and columns still have a 0 entry). 

Now the way to finish the tour is clear: use entries C43 and C31 for a complete tour 
X\—x 2—X4—X3—X1. We actually have no choice since not using either of C43 or C31 
forces us to use an 00 (which represents a forbidden edge). Since C43 = C31 = 0, this 
tour has a cost equal to the lower bound of 17 in Figure 3.18. Further, this tour must be 
minimal since its cost equals our lower bound. Rechecking the tour’s cost with the orig- 
inal cost matrix in Figure 3.13, we have C12 + C24 + C43 + C31 = 3 + 5 + 4 + 5 = 17. 

We summarize the preceding reasoning with the decision tree in Figure 3.19 
summarizing choices we made and their lower bounds (L.B.). 

One general point should be made about how to take best advantage of the branch- 
and-bound technique. At each stage, we should pick as the next entry on which to 
branch (use or do not use the entry), the 0 entry whose removal maximizes the increase 
in the lower bound. In Figure 3. 15, a check of all 0 entries reveals that not using entry 
C43 will raise the lower bound by 3 + 3 = 6 (3 is the new smallest value in row 4 
and in column 3). So C43 would theoretically have been a better entry than C12 to 
use for the first branching, since the greater lower bound for the subtree of tours not 
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using C 43 makes it less likely that we would ever have to check possible tours in that 
subtree. 

Now consider the 6 x 6 cost matrix in Figure 3.20. We present the first few 
branchings in Figure 3.21 for this cost matrix. We circle the 0 entry on which we 
branch at each stage. 

The branch-and-bound method is used not only to solve optimization problems. 
It is also a central tool in artificial intelligence. Computers that play chess make a 
tree of possible future moves and then assign some sort of “value” to the resulting 
chessboard situations. A move that leads to a bad situation gets a very high value, 
and future moves from this situation are not pursued. At more promising situations, 
all reasonable next moves are considered and their resulting situations evaluated. 
The more powerful the computer, the more moves into the future it can examine, 
although the procedure for evaluating positions is just as important as the machine’s 
speed. 

Next we present a quicker algorithm for obtaining good (near-minimal) tours, 
when the costs are symmetric (i.e., c, ; - = c ; ,) and the costs satisfy the triangle 
inequality — cik < Cij + Cjk ■ These two assumptions are satisfied in most traveling 
salesperson problems. After describing the algorithm and giving an example of its 
use, we will prove that at worst the approximate algorithm’s tour is always less than 
twice the cost of a true minimal tour. A bound of twice the true minimum may sound 
bad, but in many cases where a ballpark figure is needed (and the exact minimal tour 
can be computed later if needed), such a bound is quite acceptable. Furthermore, in 
practice our approximate algorithm finds a tour that is close to the true minimum. 
This algorithm uses a successive nearest-neighbor strategy. 

Approximate Traveling Salesperson Tour Construction 

1. Pick any vertex as a starting circuit C\ consisting of 1 vertex. 

2. Given the &- vertex circuit Ck, k > 1, find the vertex Zk not on Ck that is closest 
to a vertex, call it y k , on C k - 

3. Let Ck + 1 be the k + 1 -vertex circuit obtained by inserting Zk immediately in front 
of y k in Ck- 

4 . Repeat steps 2 and 3 until a Hamilton circuit (containing all vertices) is formed. 
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Figure 3.21 Tree of traveling salesperson partial solutions 
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Example 1: Approximate Traveling Salesperson Tour 

Let us apply the preceding algorithm to the 6 -vertex traveling salesperson problem 
whose cost matrix is given in Figure 3.22. Name the vertices x \ , X 2 , X 3 , X 4 , X 5 , X 6 - 
We will start with xi as C\. Vertex X4 is closest to xi, so C 2 is x\— X4— Xi. Ver- 
tex X3 is the vertex not in C 2 closest to a vertex in C 2 — namely, closest to X4; thus 
C 3 = xi— X3— X4— x\. There are now two vertices, X2 and X6, 3 units from vertices in 
C 3 . Suppose we pick X2. It is inserted before X3 to obtain C 4 = X\ — X 2 — X3— X4— x t . 
Vertex x ( , is still 3 units from xi, so we insert X 6 before xi, obtaining C 5 = 
xi— X2— X3— X4— X6— Xi. Finally, X5 is within 4 units of X3 and x ( , . Inserting X5 be- 
fore X6, we obtain our near-minimal tour C’c, — xi — X2— X3— X4— X5— x$— xi, whose 
cost we compute to be 19. 

In this case, the length of this tour obtained with the Approximate Tour Construc- 
tion algorithm is quite close to the minimum (which happens to be 18). The length 
of the approximate tour typically depends on the starting vertex. If we suspected that 
our approximate tour was not that close to an optimal length, then by trying other 
vertices as the starting vertex (that forms C 1 ) and applying the algorithm, we will 
get other near-minimal tours. Taking the shortest of this set of tours generated by 
the Approximate Tour Construction would give us an improved estimate for the true 
minimal tour. ■ 

Theorem 

The cost of the tour generated by the approximate tour construction is less than twice 
the cost of the minimal traveling salesperson tour. 


Proof Optional 

Suppose we are successively building the ^-vertex circuits C/ ( according to the ap- 
proximate tour construction. Let S* be a subset of edges in the true minimal tour C* 
that connect the vertices in C* to the other vertices in G < S* is described precisely 
below). When k = 1, we let S 1 consist of C*—e*. where e* is the costliest edge in C*. 
Since C\ is a single vertex and C*—e* is a Hamilton path, .S) = C*—e* will connect 
Ci with the rest of G. 

For concreteness, see the 8 -vertex graph in Figure 3.23a, where C\ is the vertex 
X 6 and .S ' 1 = C*—e* is the Hamilton path of solid edges. If zi = X 3 and so C 2 is 
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Figure 3.23 


xe—xy—xf,, then we set So — Si—(x$, x fl ) (C 2 is shown by dashed edges and So by 
solid edges in Figure 3 . 23 b). In general, we will obtain 5 / t+i from S( : by removing 
the brst edge on the path in S/ ; from Ck to the new vertex z.k being added to C*. In 
Figure 3 . 23 c, suppose that z 2 = vg and y 2 = x (] . So x x is inserted into Co between X3 
and X6 to obtain C 3 = x<-,— xs— X3 — xr , . Edge (xz, x 7 ) is removed from So to get So , 
since (xz , x 7 ) is the brst edge on the path in So from Co (specifically xz ) to x x . 

By the shortest edge rule for picking z 2 = x x and y 2 = x 6, we know that Cf,g [the 
cost of edge (x6, xs)] is the smallest cost among all edges between C 2 (= X(,~ X3— x&) 
and the rest of G. Thus C68 < [where (x 6 , x 7 ) was the edge removed from So_ 
to get So ] . Using this fact and the triangle inequality, we shall now prove that in- 
serting X8 into C 2 to get C3 has a net increase in cost < 2 q ) 7 . The increase of 
C3 over C 7 is C68 + C38 — C36, since edges (x6, xs) and (x3,xs) replace (x3,X6). 
But by the triangle inequality, C38 < C36 + C68, or equivalently C38— C36 < C68- Thus 
C68 + (C38-C36) < c 6 8 + c 6 8 < 2 c 67 , as claimed. 

This same argument can be applied when we insert each successive Zk into C* to 
prove that the increase of Ct+i over Ck is at most twice the cost of the edge dropped 
from Sk to get So+\ . Starting from C\ and repeating this bound on the insertion cost, 
we see that C n , the final near-optimal Hamilton circuit, is bounded by twice the sum 
of the costs of the edges in S \ — C*—e*, which is less than twice the cost of the 
minimal traveling salesperson tour C*. ♦ 



3.3 EXERCISES 


1 . Subtract the value of each nondiagonal entry in Figure 3.13 from 10 . Now solve 
the traveling salesperson problem for this new cost matrix. 


2. Solve the traveling salesperson problem for the cost matrix in Figure 3 . 20 . 

3. Write a computer program to solve a six-city traveling salesperson problem and 
use it for the cost matrix in Figure 3 . 22 . 


4. Use ad hoc arguments to show that the cost of a minimal tour for the cost matrix 
in Figure 3.22 is 18 . 


5 . Every month a plastics plant must make batches of five different types of plastic 
toys. There is a conversion cost Cy in switching from the production of toy i to 
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toy j, shown in the matrix. Find a sequence of toy production (to be followed for 
many months) that minimizes the sum of the monthly conversion costs. 
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6. The assignment problem is a matching problem with n people and n jobs and a 
cost matrix with entry Cy representing the “cost” of assigning person i to job /'. 
The goal is a one-to-one matching of people to jobs that minimizes the sum of 
the costs. Set all diagonal entries in Figure 3.13 equal to 5 and solve this 4x4 
assignment problem by a branch-and-bound approach. How does an assignment 
problem differ from a traveling salesperson problem? 

7 . Use the approximate algorithm to find approximate traveling salesperson tours 
for the cost matrices in the following (use just entries above the main diagonal): 

(a) Figure 3.13 (b) Figure 3.20 (c) Exercise 5 


8. Consider the following rule for building approximate traveling salesperson tours. 
Starting with a single-vertex tour 7j, successively add the vertex whose insertion 
into 7j to form Tjt+i minimizes the increase in cost — that is, if x is inserted 
between Xk and Xk+ 1 , then Ckr + c r (,t+i) — Q(/t+i) should be minimal over all 
choices of x r and Xk . 

(a) Can you prove an upper bound on this method similar to the one found for 
the approximate algorithm (making the same assumptions)? 

(b) Apply this method to the cost matrix in (i) Figure 3.20 (using just entries 
above the main diagonal) and (ii) Figure 3.22. 

9 . Find a 3 x 3 cost matrix for which two different initial lower bounds can be ob- 
tained (with different sets of 0 entries) by subtracting from the rows and columns 
in different orders. 

10 . Make up a 5 x 5 cost matrix for which the Approximate Tour Construction finds 
the following: 

(a) An optimal tour 

(b) A fairly costly tour (at least 50% over the true minimum). 



3.4 TREE ANALYSIS OF SORTING ALGORITHMS 

One of the basic combinatorial procedures in computer science is sorting a set of 
items. Items are sorted according to their own numerical value or the value of some 
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associated variable. To simplify our discussion, we will assume that all items have 
distinct numerical values. 

Some of the sorting procedures we mention will use binary trees explicitly. All the 
procedures use binary testing trees implicitly — that is, the procedures use a sequence 
of tests that compare various pairs of items. A binary testing tree model of sorting has 
one important consequence. The tree must have at least n\ leaves if there are n items, 
since there are n\ possible permutations, or rearrangements, of the set. By Theorem 
4 in Section 3.1, the height of a binary testing tree must be at least [~log 2 « !] , which is 
approximately n log 2 n. Thus we obtain the theorem. 

Theorem 

In the worst case, the number of binary comparisons required to sort n items is at least 
0(n log 2 n). 

The theorem refers to the worst case, because certain sorting algorithms will sort 
some sets very quickly (corresponding to short paths in the binary testing tree). One 
can also show that the average number of binary comparisons required to sort n items 
is at best of order 0(n log 2 n) (see Exercise 3). A function gin) is said to be of order 
0( f(n)) if, for large values of n, g(n) < cfin), for some constant c. 

The best known sorting algorithm is called bubble sort, so named because small 
items move up the list the way bubbles rise in a liquid. It is compactly written as 

FOR m <— 2 TO n DO 

FOR i<- n STEP -1 TO m DO 

IF THEN interchange items A± and 

This procedure will always require (n — 1) + (n — 2) H — • + 1 = \n(n — 1) 
binary comparisons. Thus this procedure requires 0(n 2 ) comparisons, as opposed 
to the theoretical bound of Oin log 2 n) comparisons. To see how much faster 
0(n 2 ) grows than 0(n log 2 n ), observe that for n = 50, n 2 — 2500, and n log, n & 
300; and for n = 500, n 2 — 250,000, and n log 2 n « 4500. 

The simplest-to-state sorting procedure that achieves the 0(n \og 2 n) bound is 
merge sort. It recursively subdivides the original list and successive sublists in half 
(or as close to half as possible) until each sublist consists of one item. Then it succes- 
sively merges the sublists in order. The subdivision process is naturally represented 
as a balanced binary tree and the merging as a reflected image of the tree. 

Example 1: Merge Sort 

Sort the list 5, 4, 0, 9, 2, 6,1, 1, 3, 8 using a merge sort. 

The subdivision tree and subsequent ordered merges are shown in Figure 3.24. ■ 

To analyze the number of binary comparisons in a merge sort, we make the 
simplifying assumption that n = 2 r , for some integer r. Then the subdivision tree will 
have sublists of size 2 r_1 at the level- 1 vertices. In general, there will be 2 r ~ k items 
in the sublists at level k. At level r, there are leaves each with one item. 
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Figure 3.24 


0, 1, 2, 3, 4, 5, 6, 7, 8, 9 


In the merging tree, first the pairs of leaves are ordered at each vertex on level 
r — 1; this requires one binary comparison (to see which leaf item goes first). See 
Figure 3.24. In general, at each vertex on level k we merge the two ordered sublists 
(of 2 r ~ k ~ l items) of the two children into an ordered sublist of 2 r ~ k items; this 
merging will require 2 r ~ k — 1 binary comparisons (verification of this number is left 
as an exercise). Finally, the two ordered sublists of 2 r_1 items at level 1 are merged at 
the root. The number of binary comparisons at all the vertices on level k is 2 k (2 r ~ k — 1) 
since there are 2 k different vertices on level k. Summing over all levels, we compute 
the total number of binary comparisons 

r — 1 r — 1 

2 k (2 r ~ k - 1) = ^(2 r - 2 k ) 
k=0 k = 0 

r - 1 r — 1 

=E 2 '-Z 2 ‘ 

k = 0 k=0 

= r2 r - (2' - 1) = (log 2 n)n - (n - 1) 

since n = 2' and hence r = log 2 n . Thus the number of binary comparisons in a merge 
sort is 0(n log 2 n). However, extra computer time is required to implement the initial 
subdivision process. This extra work also requires only 0(n log, n) steps. 

The 0(n log j n ) bound is also achieved by the tree-based sorting procedure called 
heap sort. A heap is a binary or almost-binary (some internal vertices may have just 
one child) tree with each internal vertex’s value being numerically greater than the 
values of its children. Figure 3.25 a shows a heap involving the numbers 0 through 9. 
The root of the heap will have the largest value in the set. If we remove the root of 
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Figure 3.25 




the heap (and place it at the end of the ordered list we are building), then we can 
reestablish a heap by making the larger child of the root the new root and recursively 
replacing each internal vertex moved up with the larger of its two children. The arrows 
in Figure 3.25a show how vertices move up when the root 9 is removed. The new heap 
is shown in Figure 3.25 b. Now we again remove the root of the heap (and make it the 
next-to-last item in the sorted list). Repeat this procedure until the heap is emptied 
and the sorted list complete. The one important missing step is creating the initial 
heap. This problem and further discussion of heap sort are left for the exercises. 



3.4 EXERCISES 


Summary of Exercises The first exercises are based on the sorting 
methods presented in this section. The remaining exercises present other sorting 
schemes. 


1. Apply merge sort to the following sequences: 

(a) 6,9, 5,0, 3, 1,8,4, 2,7 

(b) 15, 27, 4, -7, 9, 13, 8, 28, 12, 20, -80 

2. Show that at most n — 1 comparisons are needed to merge two sorted sublists 
into a single sorted list of n items. 

3. Use the result of Exercise 17 in Section 3.1 to show that the average number of 
binary comparisons required to sort n items is at least 0(n log 2 n). 

4. (a) Describe how to build an initial heap from an unordered list of n items (the 

initial heap should be a balanced tree). 

(b) Use your method in part (a) to make a heap for the lists in Exercise 1. 

(c) Apply heap sort to the heaps in part (b). 

5. Show that heap sort requires 0(n logi n) comparisons to sort n items (this includes 
the initial construction of a heap). 

6. Modify the following sorting methods to allow for repeated (two or more equal) 
items: 

(a) Bubble sort (b) Merge sort (c) Heap sort 
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7. Write a computer implementation of (use a recursive language such as PASCAL) : 

(a) Merge sort (b) Heap sort 

8. Consider the following sorting scheme, which we call tree sort. We build a 
“dictionary look-up” binary tree recursively. The first item in the list to be sorted 
is the root. The second item is a left or right child of the root, depending on 
whether it numerically precedes or follows the root. We continue to add each 
successive item as a leaf to this growing tree. For example, if a list begins 8, 4, 
11,6,..., the tree after four items would be 


8 



4 11 


6 

When all items have been incorporated into the tree, the sorted order is obtained 
by an inorder traversal of the tree (inorder traversals were defined in Section 3.2). 
Build the tree for the list in Exercise 1(a). 

9. Consider the following sorting scheme for the list A\, At, . . . , A„(n = 2' ). First 
do a sort (one comparison) of A, and A i+n / 2 ; call this the ith ordered pair, for 
i = 1, 2, . . . , n/2. Next do a merge sort of the ith and the (i +«/4)th ordered 
pairs; call this the ith 4-tuple. Next do a merge sort of the ith and the (i + n/8)th 
ordered 4-tuples. Continue until there is just one sorted list of all n items. 

(a) Apply this sorting method to the list 10, 12, 7, 0, 5, 8, 11, 15, 1, 6, 3, 9, 13, 
4, 2, 14. 

(b) How many comparisons does this sorting method require for a list of n 


numbers? 



X. 


3.5 SUMMARY AND REFERENCES 


This chapter examined a variety of search and data organization problems. The com- 
mon graph-theoretic tool for all these problems was trees. Section 3.1 presented 
basic properties and terminology of trees. Section 3.2 introduced spanning trees and 
demonstrated the uses of depth-first and breadth-first search. Section 3.3 gave tree- 
based branch-and-bound and heuristic approaches to a famous optimization problem, 
the traveling salesperson problem. Section 3.4 looked at the decision trees underlying 
sorting algorithms. 

The first paper implicitly using trees was Kirchhoff ’s 1 847 fundamental paper 
about electrical networks. Cayley was the first person to use the term tree in an 1857 
formula for counting ordered trees (Theorem 5 in Section 3.1). Searching methods 
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have been around for years (see Lucas [3]), but a systematic development of this 
subject came only in recent years with the advent of digital computers. There are 
several good computer science books about searching, sorting, and graph algorithms. 
See Kruse [1] or Sedgewick [4]. For a good survey of the Traveling Salesperson 
Problem, see Lawler et al. [2]. 

1. R. Kruse, Data Structures and Program Design in C++, Prentice-Hall, Upper 
Saddle River, NJ, 1998. 

2. E. Lawler, J. Lenstra, A. Rinnooy Kan, and D. Shmoys, The Traveling Salesman 
Problem, John Wiley, New York, 1985. 

3. E. Lucas, Recreations Mathematiques, Gauthier- Villars, Paris, 1891. 

4 . R. Sedgewick, Algorithms in C, 3rd. ed., Addison- Wesley, Reading, MA, 1997. 


CHAPTER 4 

NETWORK 

ALGORITHMS 



SHORTEST PATHS 


In this chapter we present algorithms for the solution of four important network 
optimization problems: shortest paths, minimum spanning trees, maximum flows, 
and the transportation problem. By a network we mean a graph with a non-negative 
integer k(e) assigned to each edge e. This integer will typically represent the “length” 
or “cost” of an edge, in units such as miles or dollars, or represent “capacity” of an 
edge, in units such as megawatts or gallons per minute. The optimization problems 
we shall discuss arise in hundreds of different guises in management science and 
system analysis settings. Thus good systematic procedures for their solution are 
essential. In the case of network flows, we shall see that the flow optimization 
algorithm can also be used to prove several well-known combinatorial theorems. 

We begin with an algorithm for a relatively simple problem, finding a shortest 
path in a network from point a to point z. We do not say the shortest path because, 
in general, there may be more than one shortest path from a to z. For the rest of this 
section, let us assume that all networks are undirected and connected. 

Let us immediately eliminate one possible shortest path algorithm: Determine 
the lengths of all paths from a to z, and choose a shortest one. The computer is fast, 
but not that fast — such enumeration is already infeasible for most networks with 100 
vertices. So when we find a shortest path, we must be able to prove it is shortest 
without explicitly comparing it with all other a-z paths. Although the problem is now 
starting to sound difficult, there is a straightforward algorithmic solution. 

The algorithm we present is due to Dijkstra. This algorithm gives shortest paths 
from a given vertex a to all other vertices. Let k(e) denote the length of edge e. Let the 
variable m be a “distance counter.” For increasing values of m, the algorithm labels 
vertices whose minimum distance from vertex a is m. The first label of a vertex x will 
be the previous vertex on the shortest path from a to x. The second label of x will be 
the length of the shortest path from a to x. 

Shortest Path Algorithm 

1. Set m — 1 and label vertex a with (— , 0) (the ” represents a blank). 

2. Check each edge e — ip, q ) from some labeled vertex p to some unlabeled vertex 

q. Suppose p’s labels are [r, dip)]. If dip) + k(e) = in, label q with ip, m). 
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3. If all vertices are not yet labeled, increment m by 1 and go to Step 2. Otherwise 
go to Step 4. If we are only interested in a shortest path to z, then we go to Step 
4 when z is labeled. 

4. For any vertex y, a shortest path from a to y has length d(y ), the second label of 
y. Such a path may be found by backtracking from y (using the first labels) as 
described below. 

Observe that instead of concentrating on the distances to specific vertices, this 
algorithm solves the questions: how far can we get in 1 unit, how far in 2 units, in 3 
units, . . . , in m units, . . . ? Formal verification of this algorithm requires an induction 
proof (based on the number of labeled vertices). The edges that make up the shortest 
paths from a to the other vertices can be shown to form a spanning tree (Exercise 11). 

The key idea is that to find a shortest path from a to any other vertex 
we must first find shortest paths from a to the “intervening” vertices. If P % = 
(si, S 2 , ■ ■ ■ , Sk) is a shortest path from to Sk, then Ft = Pk-\ + Cs>_i , .$>), 

where Pk-i = (.v i , sq , , Sk- 1 ) is a shortest path to ,s>_ i . Similarly, Pk-\ = Pk -2 + 
(Sk- 2 , Sk- i), and so on. 

To record a shortest path to Sk all we need to store (as the first part of a label 
in the above algorithm) is the name of the next-to-last vertex on Pt — namely, s*_i. 
Preceding Sk- | on the shortest path from a is Sk- 2 , the next-to-last vertex on !\- 1 . By 
continuing this backtracking process, we can recover all of Pk- 

The algorithm given above has two significant inefficiencies. First, if all sums 
d(p) + k(e) in Step 2 have values of at least in' > in, then the distance counter m 
should be increased immediately to m' . Second, one does not need to repeat the 
computation of the expressions d(p ) + k(e) in Step 2 every time m increases. Instead, 
each unlabeled vertex q can be given a “temporary” label ( p , d*(q)), where d*(q) 
equals the minimum of the d(p) + k(e), for those labeled vertices p with an edge 
e = (p, q ) to q. Thus, d*(q) represents the current shortest path length to q using 
labeled vertices. Now q s temporary label needs to be updated only when a newly 
labeled vertex p is adjacent to q. Details of this and other improvements in the form 
of Dijkstra’s algorithm given here can be found in Ahuja et al. [1]. 

Example 1: Shortest Path 

A newly married couple, upon finding that they are incompatible, want to find a 
shortest path from point N (Niagara Falls) to point R (Reno) in the road network 
shown in Figure 4.1. We apply the shortest path algorithm. First A is labeled (— , 0). 
For m = 1, no new labeling can be done [we check edges (A, b), (A, d), and 
(A,/)]. For m = 2, d(N) + k(N , b) — 0 + 2 = 2, and we label b with (A, 2). For m = 
3, 4, no new labeling can be done. For m = 5, d{b) + k(b, c) = 2 + 3 = 5. So we label 
c with (b, 5). We continue to obtain the labeling shown in Figure 4.1. Backtracking 
from R, we find the shortest path to be N-b-c-d-h-k-j-m-R with length 24. ■ 

If we want simultaneously to find the shortest distances between all pairs of 
vertices (without directly finding all the associated shortest paths), we can use the 
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following simple algorithm due to Floyd. Let matrix D have entry djj = oo (or a very 
large number) if there is no edge from the /th vertex to the /th vertex; otherwise dy 
is the length of the edge from x, , to xj . Then Floyd’s algorithm is most easily stated 
with the following deceptively simple computer program: 

FOR k <— 1 TO n DO 

FOR it-1 TO n DO 

FOR j <- 1 TO n DO 

IF d*+ djy< dy THEN dy <- <3*+ dy; 

When finished, dij will be the shortest distance from the /th vertex to the jth 
vertex. 



4.1 EXERCISES 


Summary of Exercises The first six exercises involve shortest path 
calculations. The remaining exercises discuss associated theory (fairly easy theory), 
and the last exercise asks for a program. 


1. Use the shortest path algorithm to find the shortest path between vertex c and 
vertex m in Figure 4.1. 

2. Find the shortest path between the following pairs of vertices in the network in 
Figure 4.2 in Section 4.2: 

(a) a and y (b) d and r (c) e and g 

3. The network below shows the paths to success from L (log cabin) to W (White 
House). The first number on an edge is the time (number of years) it takes to 
traverse the edge; the second is the number of enemies you make in taking that 
edge. Use the shortest path algorithm to answer the following: 

(a) Find the quickest path to success (from L to W ) . 

(b) Find the quickest path to success that avoids points c, g, and k. 

(c) Find the path to success (from L to W) that minimizes the total number of 
enemies you make. 


130 


Chapter 4 Network Algorithms 



(d) Alter the order in which edges are checked in Step 2 of the shortest path 
algorithm to get another minimum-enemies path. How many such minimum- 
enemies paths are there? 

4 . With reference to Exercise 3, let the roughness index R of a path to success be 
R = T + 2 E, where T is the time to get from L to W and E is the total number of 
enemies made. 

(a) Find the smoothest (least rough) path to success. 

(b) Find the smoothest path to success that includes edge (f i): this edge can be 
traversed in either direction. 

5. Ignore the numbers on the edges in Exercise 3 and use the shortest path algorithm 
to find the following shortest (fewest edges) paths: 

(a) Shortest path from /. to IV 

(b) Shortest path from L to W including vertex cl 

(c) Shortest path from L to W including both vertex e and vertex m 

6. Suppose that the edges in Exercise 3 are directed according to the alphabetical 
order of the endpoints, where L precedes a and W follows m [so edge (r~, s ) goes 
from r to s if r precedes s in the alphabet]. Find the quickest path from L to Win 
this directed network. 

7. Prove by induction on m that Dijkstra’s shortest path algorithm finds the shortest 
path from a to every other vertex in the network. 

8. Prove that Floyd’s algorithm finds the shortest path between all pairs of vertices. 

9. Make up an example to show that Dijkstra’s algorithm fails if negative edge 
lengths are allowed. 

10 . (a) Show that if the edges are properly ordered (and the edges are checked in this 

order in Step 2), the shortest path algorithm will produce any given shortest 
path from a to z when more than one shortest path exists. 

(b) Order the edges of the figure in Exercise 3 so that in Exercise 3(c), the 
“shortest” path found will be (L, a, c, d , /, g, k, m, W ). 

11 . Show that in the shortest path algorithm, the edges used in Step 2 to label new 
vertices form a spanning tree. 

12 . The transitive closure of a directed graph G is obtained by adding to G an edge 
(xf, x j ) for each nonadjacent pair x,- , Xj with a directed path from to x,- to xj . Let 
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d,j = 1 if (x{*, xj ) is an edge in G and = 0 otherwise. Replace the IF statement 
in Floyd’s algorithm with 


IF djk ■ dkj > dij THEN dy 1 


Show that this revised Floyd’s algorithm finds the transitive closure of G. 
13 . Write a computer program implementing Dijkstra’s shortest path algorithm 



4.2 MINIMUM SPANNING TREES 


A minimum spanning tree in a network is a spanning tree whose sum of edge lengths 
k(e ) is as small as possible. Minimum spanning trees arise in a variety of important 
commercial settings, such as finding a minimum-length fiber-optic network to link 
a given set of sites. This problem appears to be harder than finding a shortest path 
between two given vertices, but with the proper algorithm it is actually easier to solve 
by hand than the shortest path problem. The reason for the simple solution is that 
there are straightforward “greedy” algorithms that can build a minimum spanning 
tree by successively picking a shortest available edge. 

We present two greedy algorithms for building a minimum spanning tree. Let n 
denote the number of vertices in the network. 

Kruskal’s Algorithm 

Repeat the following step until the set T has n — 1 edges (initially T is empty): add to 
T the shortest edge that does not form a circuit with edges already in T. 

Prim’s Algorithm 

Repeat the following step until tree T has n — 1 edges: add to T the shortest edge 
between a vertex in T and a vertex not in T (initially pick any edge of shortest length). 

In both algorithms, when there is a tie for the shortest edge to be added, any of 
the tied edges may be chosen. A proof of the validity of Kruskal’s algorithm is given 
in Exercise 12. We prove the validity of Prim’s algorithm shortly. 

Example 1: Minimum Spanning Tree 

We seek a minimum spanning tree for the network in Figure 4.2. Both algorithms 
start with a shortest edge. There are three edges of length 1: (a,/), (/, q ), and (r, w). 
Suppose we pick (a,/). If we follow Prim’s algorithm, the next edge we would add 
is (a, b) of length 2, then (/, g ) of length 4, (g, /), then (/, q ), then (/, m), and so forth. 
The next-to-last addition would be either (m, n) or (o, t) both of length 5 [suppose we 
choose (m, «)], and either one would be followed by (n, o). The final tree is indicated 
with darkened lines in Figure 4.2. 

On the other hand, if we follow Kruskal’s algorithm, we first include all three 
edges of length 1: (a,/), (/, q), (r, w). Next we would add all the edges of length 2: 
(a, b), (e, f), (g, /), (h, /'), (Z, m), ( p , n), (s, x), (x, y). Next we would add almost all the 
edges of length 3: (c, h), id, e), (k, l), ( k , p), (q, v), (r, s), (v, w), but not (w, x) unless 
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(r, s ) were omitted [if both were present we would get a circuit containing these two 
edges together with edges (r, w) and ( s , *)]. Next we would add all the edges of length 
4 and finally either (m, ri), or ( o , t ) to obtain the same minimum spanning tree(s) 
produced by Prim’s algorithm. This similarity is no coincidence (see Exercise 14). ■ 
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The difficult part in the minimum spanning tree problem is proving the minimality 
of the trees produced by the two algorithms. 

Theorem 

Prim’s algorithm yields a minimum spanning tree. 


Proof 

For simplicity, assume that the edges all have different lengths. Let T' be a minimum 
spanning tree chosen to have as many edges as possible in common with the tree T * 
constructed by Prim’s algorithm. 

If T* f- T' , let e k — (a, b) be the first edge chosen by Prim’s algorithm that is not 
in T' . This means that the subtree T k _\, composed of the first k— 1 edges chosen by 
Prim’s algorithm, is part of the minimum tree T . In Figure 4.3, edges of T k -\ are in 
bold, efs edge is dashed, and the other edges of T' are drawn normally. Since e k is 
not in the minimum spanning tree T', there is a path, call it P, in T' that connects a 
to b. At least one of the edges of P must not be in T k _\ , for otherwise P U e k would 
form a circuit in the tree T k formed by the first k edges in Prim’s algorithm. Let e* be 
the first edge along P (starting from a) that is not in T k _\ (see Figure 4.3). Note that 
one end vertex of e* is in 7*— i- 

If e* is shorter than e k , then on the &;th iteration. Prim’s algorithm would have 
incorporated e*, not e k . If e* is greater than e k , we remove e* from T' and replace it 
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with <?£. The new T' is still a spanning tree (see Exercise 10 for details), but its length 
is shorter — this contradicts the minimality of the original T' . ♦ 


4.2 EXERCISES 



' Summary of Exercises The first five exercises involve building 
minimum spanning trees. Exercises 6-15 are variations and theory questions about 
the two minimum spanning tree algorithms. 

1. The network below shows the paths to success from L (log cabin) to W (White 
House). The first number is the cost of building a freeway along the edge and the 
second is the number of trees (in thousands) that would have to be cut down. 



(a) Use Kruskal’s algorithm to find a minimum-cost set of freeways connecting 
all the vertices together. 

(b) Court action by conservationists rules out use of edges (c, e), (d, /), and 
( k , W ). Now find the minimum-cost set of freeways. 

(c) Find two nonadjacent vertices such that the tree in part (a) does not contain 
the cheapest path between them. 

(d) Use Prim’s algorithm to find a set of connecting freeways that minimizes the 
number of trees cut down. 

2. With reference to Exercise 1 , suppose that the set of freeways must include city c 
or city d (but not necessarily both) and all the other cities. Find the minimum-cost 
set of freeways. 

3. Find a minimum spanning tree for the network in Figure 4. 1 using 

(a) Prim’s algorithm (b) Kruskal’s algorithm 

4. With reference to Exercise 1, suppose that the governor’s summer home is along 
edge (/, ;)■ Find a minimum-cost set of freeways such that (/, i ) is in that set. 

5. Find a maximum spanning tree (whose sum of edge lengths is maximum) for the 
network in Figure 4.2. 

6. If each edge has a different cost, show that the minimum spanning tree is unique. 

7. Modify Kruskal’s algorithm so that it finds a minimum spanning tree that contains 
a prescribed edge. Prove that your modification works. 

8. Modify Prim’s algorithm so that it finds a maximum spanning tree. 
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9. In the proof of the Theorem, show that 

(a) 6k = (a, b) has one end vertex (a or h ) in the tree 7/ ( _i . 

(b) e* has one of its end vertices in the tree 7) _ i . 

10 . Show that the new T (mentioned in the last sentence of the proof of the 
Theorem) with e* replaced by e/ t is a tree — that is, that T is connected and 
circuit-free. 

11 . Let The the spanning tree found by Prim’s algorithm in an undirected, connected 
network N. 

(a) Prove that T contains all edges of shortest length in N unless such edges 
include a circuit. 

(b) Prove that if e* = (a, b) is any edge of N not in T and if P is the unique path 
in T from a to b, then for each edge e in P, k(e) < k(e*). 

(c) Prove that part (b) characterizes a minimum spanning tree — that is, that any 
spanning tree T (not just those formed by Prim’s algorithm) is a minimum 
spanning tree if and only if part (b) is always true for T. 

12 . Prove that Kruskal’s algorithm gives a minimum spanning tree. 

13 . Construct an undirected, connected network with eight vertices and 15 edges 
that has a minimum spanning tree containing the shortest path between every 
pair of vertices. 

14 . Suppose that the edges of the undirected, connected network N are ordered and 
that in both Prim’s and Kruskal’s algorithms, when there is a tie for the next 
edge to be added, the smaller indexed edge is chosen. 

(a) Prove that the edges can be ordered so that Prim’s algorithm will yield any 
given minimum spanning tree. 

(b) Prove that the edges can be ordered so that Kruskal’s algorithm will yield 
any given minimum spanning tree. 

(c) Prove that with ordered edges, both algorithms give the same tree. 

15 . Given an undirected, connected /z- vertex graph G with lengths assigned to each 
edge, we form a graph G n whose vertices correspond to minimum spanning trees 
of G with two vertices v\, v 2 adjacent if the corresponding minimum spanning 
trees 7j , T 2 differ by one edge — that is, 7j = T 2 —e' + e" (for some e\ e"). 

(a) Produce an 8-vertex network G such that G^ is a chordless 4-circuit. 

(b) Prove that if 7) and T 2 are minimum spanning trees which differ by k edges, 
that is |7\ fl T 2 \ = n — k, then in G v there is a path of length k between the 
corresponding vertices. 

16 . Write a computer program to implement (as efficiently as possible) the following: 

(a) Kruskal’s algorithm (b) Prim’s algorithm 
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4.3 NETWORK FLOWS 

In this section we interpret the integer k(e) associated with edge e in a network as a 
capacity. We seek to maximize a “flow” from vertex a to vertex z such that the flow 
in each edge does not exceed that edge’s capacity. Many transport problems are of 
this general form — for example, maximizing the flow of oil from Houston to New 
York through a large pipeline network (here the capacity of an edge represents the 
capacity in barrels per minute of a section of pipeline), or maximizing the number of 
telephone calls possible between New York and Los Angeles through the cables in a 
telephone network. It is convenient to assume initially that all networks are directed. 

We define an a-z flow /(e) in a directed network N to be an integer-valued 
function /(e) defined on each edge e — /(e) is the flow in e — together with a source 
vertex a and a sink vertex z satisfying the following conditions. In(x) and Out(x) 
denote the sets of edges directed into and out from vertex x, respectively. 

(a) 0 < /(e) < k(e) 

(b) Forx^aorz, /(e) = /(e). 

e€ln(jt) eeOut(x) 

To simplify our analysis, we assume that the flow goes from a to z, never in the 
reverse direction: 

(c) /(e) = 0 if e e In(a) ore e Out(z). 

A sample flow is shown in Figure 4.5; the capacity and flow in each edge e are 
written beside the edge; k(e),f(e). The assumption of integer capacities and flows is 
not restrictive — that is, the units we count could be thousandths of an ounce rather 
than barrels. The requirement that there be a single supply vertex, the source a, and a 
single demand vertex, the sink z, also turns out not to be a restriction. The reason is 
that we can recast network problems with multiple sources and multiple sinks into a 
single-source, single-sink network, as illustrated in the following example. 

Example 1: Flow Networks with Supplies and Demands 

Consider the network of solid edges in Figure 4.4 with supplies and demands. Vertex 
b can supply up to 60 units of flow, and vertices c and d can each supply 40 units. 
Vertices h, i, and j have flow demands of 50, 40, and 40 units, respectively. Can we 


b 30 e 20 h 
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meet all the demands? The sources could be oil refineries and the sinks oil-truck 
distribution centers, or the sources factories and the sinks warehouses. 

We model this network problem with a standard one-source, one-sink network 
as follows. Make b, c, and d regular nonsource vertices. Add a new source vertex a 
and edges (a"*, b), {a*, c), and (a", d ) having capacities 60, 40, and 40, respectively. 
This construction simulates the role of the capacitated sources b, c, d. Next make h, 
i, and j regular nonsink vertices and add a new sink vertex z and edges (hT, z), (i~, z), 
and (j~, z), having capacities 50, 40, and 40, respectively. 

A flow satisfying the original demands is equivalent to a flow in the new network 
that saturates the edges coming into z, that is, a flow of value 130. ■ 

Let ( P , P) denote the set of all edges (x*. y) with vertex x e P and y e P) 
(where P denotes the complement of P). We call such a set ( P . P ) a cut. The cut 
({o, b, c}, {d, e, j}) in Figure 4.5 consists of the edges (ZT, d), (b ", e), (c"*, e); the edge 
(cT, c ) is not in the cut because it goes from P to P. This cut is represented in Figure 
4.5 by a dashed line that separates the vertices in P = {a, b. c} from the vertices in 
P; the edges in the cut are the edges crossing the dashed line from left to right. We 
call ( P . P) an a-z cut if a e P and z e P . The a-z cuts in a network are important, 
because all flow from a to z must cross each a-z cut. The combined capacity of the 
edges in any a-z cut is thus an upper bound on how much flow can get from a to z. 

Let /(e) be an a-z flow in the network N and let P be a subset of vertices in N not 
containing a or z ■ Summing together the conservation-of-flow equations in condition 
( b ) for each x e P, we obtain 



xeP eeOut(jc) 


xeP eeln(x) 


Certain /(e) terms occur on both sides of the preceding equality: namely, edges 
from one vertex in P to another vertex in P. After eliminating such /(e) from both 
sides, the left side becomes X^ € (p />> /( e ) an d the right side ^/ ;g(7 , P) /(e). Thus 
we have 


(b') For each vertex subset P not containing a or z. 


E f (e) = E /(*> 


ee(P,P) ee(P,P) 


That is, the flow into P equals the flow out of P. The following intuitive result is 
readily verified with (b'). 
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Figure 4.5 
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Theorem 1 

For any a-z flow f(e ) in a network N, the flow out of a equals the flow into z. 
Proof 

Assume temporarily that N contains no edge ( cT , z). Let P be all vertices in N except 
a and z. So P = {a, z}. 

Remember that the only flow into P from {a, z} must be from a, since condition 
(c) forbids flow from z. Similarly by condition (c), all flow out of P must go to z. Then 
(P, P) consists of edges going into z (from P) and (P, P) consists of edges going out 
of a (to P). Thus 

flow out of a = f(e) by = ' /(e) = flow into z 

ee(P.P) ee(P.P) 

The flow equality still holds if there is flow in an edge (cT, z). ♦ 

The value of the a-z flow /(e), denoted |/|, equals the sum of the flow in edges 
coming out of a, or equivalently by Theorem 1 , the flow into z. Let us consider the 
question of how large |/| can be. One upper bound is the sum of the capacities of the 
edges leaving a, since 

1/1= E f(e) < k ^ 

eeOut (a) eeOut(tf) 

Similarly, the sum of the capacities of the edges entering z is an upper bound for 
| / 1 . Intuitively, |/| is bounded by the sum of the capacities of any set of edges that 
cut all flow from a to z — that is, the edges in an a-z cut. 

We define the capacity k(P , P) of the cut (P, P) to be 

k(P,P)= J2 k <» 

ee( P.P) 

The capacity of the a-z cut (P, P), where P = {a, b, c), in Figure 4.5 is 13. This tells 
us that no a-z flow in the network in Figure 4.5 can have a value greater than 13. This 
motivates the following theorem. 

Theorem 2 

For any a-z flow/and any a-z cut (P, P) in a network N, \f\ < k(P, P). 

Proof 

Informally, |/| should equal the total flow from P to P | which is bounded by k{ P, P)], 
but we cannot prove an equality of this type using condition (b' ) without first modifying 
N. We cannot use (b') in N because condition (b') requires that P not contain the 
source a. 

Expand the network Aby adding a new source vertex a' with an edge e' = (a'/ a) 
of immense capacity. Assign a flow value of |/| to e', yielding a valid a'-z flow in 
the expanded network (see Figure 4.6). In effect, the old source a now gets its flow 
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Figure 4. 6 

from the “super source” a'. Note that a 1 will be part of P. Observe that new source a' 
is playing a role similar to the source a we added to the multiple-source network in 
Example 1. 

In the new network, we can apply condition (b') to P. It says that the flow into P, 
which is at least |/| (other flow could come into P along edges from P), equals the 
flow out of P. Thus 

l/l< E f^ y = E / (e) - E k(e) = k(P,P)+ (*) 

ee(P,P) ee(P,P) ee(P,P ) 



Since Theorem 2 says that the value of an a-z flow can never exceed the capacity 
of any a-z cut, it follows that if the value of some a-z flow /* equals the capacity of 
some a-z cut, then /* has to be a flow of the maximum possible value. We shall show 
that for any flow network, we can always construct an a-z flow whose value equals 
the capacity of some a-z cut. As just noted, such an a-z flow will be guaranteed to be 
a flow of maximum value. 

To see what properties a flow must have to make the inequality in Theorem 2 an 
equality, we need to look carefully at the two inequalities in expression (*) that was 
used to prove Theorem 2. 

Corollary 2a 

For any a-z flow/and any a-z cut (P, P) in a network N, |/| —k(P, P) if and only if 

(i) For each edge e e (P, P), f(e) = 0. 

(ii) For each edge e e ( P, P), /(e) = k(e). 

Further, when |/| =k(P, P), f is a maximum flow and (P, P) is an a-z cut of 
minimum capacity. 

Proof 

Consider the two inequalities in (*) of the preceding proof. The first inequality is an 
equality — the flow from P into P (in the expanded network) equals | /| — if condition 
(i) holds; otherwise the flow into P is greater than |/|. The second inequality is an 
equality — the flow out of P equals k(P, P) — if condition (ii) holds; otherwise it is 
less. Thus equality holds in (*) if and only if conditions (i) and (ii) are both true. The 
last sentence in the corollary follows directly from Theorem 2. ♦ 
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We have developed all the concepts needed to present our flow maximizing 
algorithm. We first, discuss an intuitive but faulty technique that can sometimes be 
used as a shortcut in place of the correct algorithm. After the fault in the shortcut is 
exposed, the correct algorithm can be more easily understood and appreciated. 

All normal flows can be decomposed into a sum of unit-flow paths from a to z, for 
short, a-z unit flows (abnormal flows that cannot be so decomposed are discussed in 
Exercise 23). For example, in a telephone network, the flow from New York to Los An- 
geles can be decomposed into paths of individual telephone calls. Similarly, flow of oil 
in a pipeline network can be decomposed into the paths of each individual petroleum 
molecule. Formally, an a-z unit flow//, along a-z path L is defined as fide) = 1 
if e is in L and = 0 if e is not in L. 

This suggests a way to build a maximum flow. We build up the flow as much 
as possible by successively adding a-z unit flows together, always being sure not 
to exceed any edge’s capacity. An additional unit flow can use only unsaturated 
edges, edges where the present flow does not equal the capacity. So we must build 
paths consisting of unsaturated edges. We define the slack s(e) of edge e in flow 
/by 


s(e) = k(e) - /(e) 

If ^ is the minimum slack among edges in the a-z unit flow//, then we can put an 
additional flow along L of sfi = f L + f L + f L H — • + //, (,? times). 

If /i, fi, ■ ■ ■ , f m are a-z unit flows, then f = f x + f 2 - 1 b /„, will satisfy 

conditions (b) and (c) in the definition of a flow (given at the beginning of this section) 
since f\ , f 2 , . . . , f m satisfy these conditions. If in addition, < p satisfies condition (a) — 
/(e) < k(e), for all e — then/is a valid flow. 


Example 2: Building a Flow with Flow Paths 

Let us use the method just outlined to build a maximum a-z flow for the network in 
Figure 4.7a. Note, as an upper bound, that the value of a flow cannot exceed 10, the 
sum of the capacities of edges going out of a. 

We start with no flow — that is, /(e) = 0, for all e. Now we find some path from 
a to z, for example, the a-z path L\ = a-b-d-z. The minimum slack on L\ is 3 (at the 
start, the slack of each edge is just its capacity). So to our initial zero flow, we add 
the flow 3/z.j. All edges except (If. d ) are still unsaturated. 

Suppose that we next find the a-z path Li — a-c-e-z, also with minimum slack 
3. Our current flow is 3//, + 3//, 2 , as shown in Figure 4.7/?. The path L 3 = a-b-e-z 
with minimum slack 2 can be used to get the augmenting flow 2 /y v Figure 4.7c 
shows the remaining unsaturated edges with their slacks. 

The only a-z path in Figure 4.7c is L 4 — a-c-d-z with minimum slack 1 . After 
adding the flow // 4 (see Figure 4.7 d), we can get no further than from a to c. Observe 
that we have saturated the edges in the a-z cut (P, P), where P = [a, c}. The value 
of the final flow, 9, equals the capacity k(P, P) of this cut, and so by Corollary 2a 
the flow must be maximum. ■ 
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Example 3: Faulty Flow Building 

Suppose the network in Figure 4.7a is redrawn as in Figure 4.1 e, with the positions 
of cl and e switched. 

Let us again choose augmenting a-z flow paths across the top and bottom of the 
network, now having sizes 5 and 1, respectively. Since edges ( cT , b) and (c r , cl), are 
saturated by these flow paths, the only possible a-z path along unsaturated edges is 
= a-c-e-z with minimum slack 1 [the minimum occurring in edge (e, ")). After 
adding the a-z unit flow fi 5 , we get the flow /o shown in Figure 4.7e; the network 
is redrawn in Figure 4.7f to make the cut ( P (l , P o) clearer. The cut ( Pq , P o), where 
Po = {a, c, ej, is saturated, and so no more augmenting a-z unit flows exist. Yet 
|/o| —1 and k( P q, P 0 ) = 12. Remember that a flow of size 9 was obtained for this 
same network in Example 2 ! What has happened? 

We now see that an arbitrary sequence of augmenting a-z unit flows need not 
inevitably yield a maximum flow. We are also faced with a flow /o and a saturated 
a-z cut (P 0 , P o) such that |/o| < k(Po, Po). Corollary 2a implies that there must be 
some flow in an edge e e (Po, Po)- Looking at Figure 4.7/, we see that the flow path 
L' = a-b-e-z crosses the a-z cut (Po, Po) backward (from P to Po) on the edge 
Qf, e ); or, equivalently, L' crosses the cut forward twice, on edge ( cT , b) and again 
on (e*, z). Thus the 5 units of flow along Ll use up 10 units of capacity in the cut, 
whence k(Po, Po) is 5 units greater than |/o|. 

The reason that the sequence of augmenting flow paths in this example did not 
lead to a maximum flow can be explained intuitively as follows. By sending 5 units 
of flow along L! (see Figure 4.7e), we have routed all the flow passing through h on 
to e and none of it to cl. Then only 1 unit of flow passing through c can be routed on 
to e and then along edge (cT, z). But much of the flow through c must go to e, since 
the capacity of (c~, cl) is only 1. In sum, the initial 5-unit flow along a-b-e-z was a 
“mistake” because some of the capacity in edge (cT, z) should have been “reserved” 
for flow from c. m 

How can we avoid or correct such mistakes? If we understood where the mistakes 
were made, we could change some of the flow paths and try a new sequence of 
augmenting flow paths. However, we are likely to make other mistakes in subsequent 
constructions. Indeed, there may be certain networks in which it is impossible not to 
make such a mistake, no matter what sequence of flow paths is used. In terms of cuts, 
we may always end with a saturated a-z cut that one of our flow paths crosses twice. 

Fortunately, there is a procedure to correct “mistakes” and thereby further increase 
the flow. The method will not look for edges that must be “reserved” for later flow 
paths, as suggested above (that is too hard a problem). Rather, it looks for flow that is 
going the wrong way (backward) across an a-z cut. Then it finds a way to decrease 
the backward flow without changing the forward flow across the cut, the result being 
more total flow across the cut (and through the network). The following example 
presents the idea behind this procedure. 

Example 3: (continued) 

The edge (IF, e) contains 5 units of flow going backward across the saturated cut 
(Po, Pq), where Pq = [a, c, ej. By how much can the flow in (JF, e) be reduced? 
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Condition (b) of a flow — flow in equals flow out, at each vertex — requires that 
a reduction of the flow into e from b must be compensated by an increase to e from 
elsewhere in Pq (if the compensating flow comes from P o we would have a new 
backward flow). Such an increase must in the end come from a. Thus, we need an 
a-e flow path in Pq. The only such path is K\ = a-c-e. Similarly, a reduction of the 
flow out of b to e must be compensated by an increase in flow out of b to somewhere 
else in Pq. So we need a b-z flow path in P 0 . The only such path is K 2 = b-d—z. 

The minimum slack along K\ is 2 and the minimum slack along K 2 is 3. Then 
we can decrease the flow in (ZT, e) by 2 while increasing the flow in K\ and K 2 by 2. 
Figure 4.1 g shows the resulting maximum flow. Note that this new maximum flow is 
different from the maximum flow in Figure 4.1 d, although both saturate the a-z cut 
({a, c}, {b, d, e, z}). ■ 

A chain in a directed graph is a sequence of edges that forms a path when the 
direction of the edges is ignored. A unit-flow chain from a to z along the a-z chain 
K is a “flow”/f with a value of 1 in each edge of K forwardly directed, a value of — 1 
in each edge of K backwardly directed, and a value of 0 elsewhere. Note that fk is 
not really a flow because it can assume a negative value on some edges. However, if 
/is a flow that already has positive values in each backwardly directed edge of K and 
has slack in each forwardly directed edge of K, then / + f K is a valid flow. The flow 
correction made in the continuation of Example 3 consisted of a (2-unit) a-z flow 
chain along chain K = a-c-e-b-d-z. 

When no backwardly directed edges occur in a flow chain, then it is simply a flow 
path. We shall see that any sequence of augmenting a-z flow chains can be extended 
to a maximum flow. Flow chains are the appropriate generalization of flow paths that 
both build additional flow and simultaneously correct possible “mistakes.” 

We now present a flow chain algorithm to increase the value of a flow. The 
algorithm is designed so that if it fails to obtain an augmenting a-z flow chain, it will 
produce a saturated a-z cut whose capacity equals the value | / 1 of the current flow /. 
Thus by Corollary 2a, /would be maximum. 

The algorithm recursively tries to build augmenting flow chains from a to all 
vertices in a manner reminiscent of the shortest path algorithm in Section 4. 1 . The 
algorithm assigns two labels to a vertex q: ( p :t , A (q)), where p is the previous vertex 
on a flow chain from a to q, and A (q) is the amount of additional flow that can be 
sent from a to q. That is, A (q) is the minimum slack among the edges of the chain 
from a to q. On a backwardly directed edge e, the slack is the amount of flow that can 
be removed, namely /(e). A + superscript on p means flow is being added to edge 
(/?"*, q); a — superscript means flow is being subtracted from edge (q~, p). 

Augmenting Flow Algorithm 

1. Give vertex a the labels (— , oo). 

2. Call the vertex being scanned p with second label A (p). Initially, p — a. 

(a) Check each incoming edge e = Uf. p). If /(e) > 0 and q is unlabeled, then 
label q with [p~ , A(</)], where A (q) = min[A(/?), /(e)]. 
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(b) Check each outgoing edge e = (p', q). If s(e) = k(e ) — f(e) > 0 and q is 
unlabeled, then label q with [p + , A(g)], where A (q) = min[A(/?), s(e)]. 

3. If z has been labeled, go to Step 4. Otherwise choose another labeled vertex to 
be scanned (which was not previously scanned) and go to Step 2. If there are no 
more labeled vertices to scan, let P be the set of labeled vertices, and now (P, P) 
is a saturated a-z cut. Moreover, \f\ = k(P , P), and thus/is maximum. 

4. Find an a-z chain K of slack edges by backtracking from z as in the shortest 
path algorithm. Then an a-z flow chain fa along K of A (z) units is the desired 
augmenting flow. Increase the flow in the edges of K by A(z) units (decrease flow 
if edge is backward directed in K). 

Like the shortest path algorithm, the flow algorithm extends partial-flow chains 
from currently labeled vertices to adjacent unlabeled vertices and the edges used to 
label vertices form a spanning tree. Before we prove that repeated application of our 
algorithm always leads to a maximum flow, let us give some examples. 

Example 3: (continued) 

Let us apply our augmenting flow algorithm to the flow in Figure 4.1 e. 

Vertex a is labeled (— , oo) by Step 1 of the algorithm. Next we apply Step 2b 
at a (Step 2a does not apply at a since the definition of a flow does not allow flow 
into the source a). The edge (cT, b ) is saturated, but (a*, c) has slack 5 — 2 = 3. So 
we label c ( a + , 3). At c, Step 2a finds no incoming flow from an unlabeled vertex, 
but Step 2b finds slack in edge (c~*, e) going to unlabeled vertex e. We label e (c + , 2) 
[2 is the minimum of A(c), the extra flow we can get to c, and the slack in edge (c r , e)]. 
At e, Step 2a finds a positive flow entering on edge (7; 7 e) from unlabeled vertex b. 
We label b (el 2). From b, we label d ( b + , 2), and from d we label z ( d + , 2). Sink z is 
now labeled and so Step 4 tells us we can get A(z) = 2 more units of flow from a to z. 
Backtracking with the labels, the flow chain K (in backwards order) is z-d-b-e-c-a. 
The new flow /o + 2fx is shown in Figure 4.7/. ■ 

Example 4: Using Augmenting Flow Algorithm 

Consider the network shown in Figure 4.8a. We seek a maximum flow from a to z. 

If a maximum flow were being found by a computer, it would have to start with a 
zero flow. When solving a flow problem by hand, we can speed the process by starting 
with a (nonzero) flow obtained by inspection (in small networks we can often obtain 
a maximum flow by inspection). Let the initial flow be / = 4 +4 fx 2 +5 fx 3 , 
where A) = a-b-e-z, AS = a-c-d-f-z, and // = a-d-f-z. See Figure 4.8a. The 
reader may see that we sent too much flow from c to d. Some of it should have gone 
directly from c to/. 

We now apply the labeling algorithm to the flow /. Label vertex a (— , oo). 
Scanning edges at a, there cannot be incoming edges to the source with flow [by 
part (b) of the definition of a flow], but there are three outgoing edges: edge (a", b ) 
has slack s(a b) — 2 > 0 and b is unlabeled, so we label b (a + , 2), where 2 is the 
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b(a+, 2) 7,4 e {b + , 2) 
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a (-, •) 



df.a+,2) 9,9 /(c + , 2) 

(a) 
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6(a+, 1) 7,5 e(6+, 1) 


a (-, •) 



4(c+, 1) 9, 9 f 

(c) 


Figure 4.8 

minimum of A (a) (— oo) and s(a"', /?); edge (a"*, c) has no slack; edge (a - , 4) has slack 
s(a~, d) — 2 and d is unlabeled, so we label it (a + , 2). 

Next we scan b (we will scan vertices in the order that they were labeled). There 
are two incoming edges at b\ Edge ( a / b) has f (cC . b) — 4 > 0 but a is already labeled; 
edge (c", b) has no flow. There is one outgoing edge at b: edge (IF, e) has slack 
sQf, e) = 3 and e is unlabeled, so we label e ( b + , 2), where 2 is min[A(b), s(fT, e)]. 

Next we scan at d. There are two incoming edges at d: edge (a"*, d) comes from 
a labeled vertex; edge (t r , d) has/(c/ d) — 4 and c is unlabeled, so using Step 2a we 
label c (d~ 2), where 2 = min[ A(d), f(c r , c/)]. There is one outgoing edge at d\ edge 
(cT, f) is saturated. 

No labeling can be done from e. At c we label/with (c + , 2). At/we label z with 


(/+, 2 ). 


Since z is now labeled, the labeling procedure terminates. We can send 2 [= A(z)] 
units in the augmenting a—z flow chain /^ 4 , where K 4 = a-d—c—f—z, found by the 
backtracking procedure. Recall that since edge (c r . d) is backwardly directed in K 4 . 
the flow chain /^ 4 subtracts 2 units from the current flow in edge (c", d). The new 
flow f' — f + 2/jf 4 is shown in Figure 4.8b. 

We eliminate all the current labels and restart the labeling algorithm from scratch. 
Label vertex a(— , 00 ). From a we label b with (a + , 2). At b we label e with (b + , 2). 
At e we label c with (e + , 2). At c we label d with (c + , 2) and /with (c + , 1). At d 
we can make no new labels. At /we label z with (/+, 1). The augmenting a—z flow 
chain is with K 4 = a-b—e-c—f—z, using forward edge (e~, c). Our new flow is 
f" = f + 2 f Kt + f Ks , shown in Figure 4.8c. 
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The reader may have observed that the flow f" is maximum. The incoming edges 
at z are now saturated. Let us again apply the augmenting flow algorithm to flow f". 
At a we label b; at b we label <?; at e we label c; and at c we label d. No more vertices 
can be labeled. Let P be the set of labeled vertices. Then ( P , P ) is the saturated a-z 
cut specified by the algorithm with \f"\ =16 = k( P . P). ■ 

In applying our algorithm, we must always check incoming edges in Step 2a 
for the possibility of minus labeling of vertices, even though this labeling is very 
infrequent. Recall that the minus labeling corresponds to correcting a mistaken flow 
assignment. A permissible shortcut would be to use only positive labeling (as in the 
faulty procedure discussed earlier) until no new flow paths can be found, and then 
apply the full algorithm to hunt for “mistakes.” The use of this shortcut serves to 
increase the importance of having a rigorous proof that, when repeatedly applied to 
any given flow, our algorithm will yield a maximum flow. 

Theorem 3 

For any given a-z flow/, a finite number of applications of the augmenting flow algo- 
rithm yields a maximum flow. Moreover, if P is the set of vertices labeled during the 
final (unsuccessful) application of the algorithm, then (P, P ) is a minimum a-z cut. 

Proof 

There are two main parts to the proof. First, if / is the current flow and f k is the 
augmenting a-z unit flow chain (along chain K) found by the algorithm with m = 
A (z), then we must show that the new flow / + mf K is indeed a legal flow in the 
network. Both/ and // satisfy flow conditions (b) and (c) and are integer- valued. 
Hence / + mfK also satisfies (b) and (c) and is integer- valued. The labeling algorithm 
is designed so that m = A(z) is the minimum slack (of the appropriate kind) along 
chain K and hence / + mf K satisfies flow condition (a): 0 < /(e) + mf K (e) < k{e). 

Since m is a positive integer, each new flow is larger by an integral amount. The 
capacities and the number of edges are finite, and so the algorithm must eventually 
halt — fail to label z. Let P be the set of labeled vertices when the algorithm halts. 
Clearly (P, P) is an a-z cut, since a is labeled and z. is not. Observe that there cannot 
be an unsaturated edge from a labeled vertex p to an unlabeled vertex q, or else at 
p we could have labeled q in Step 2b. Similarly, there cannot be a flow in an edge 
from an unlabeled vertex q to a labeled vertex p, or else again at p we could have 
labeled q in Step 2a. Thus, both conditions of Corollary 2a are satisfied. Hence the 
value of the final flow equals k(P, P ) and is maximum. Also (P, P) is a minimum 
a-z cut. ♦ 

Corollary 3a Max Flow-Min Cut Theorem 

In any directed flow network, the value of a maximum a-z flow is equal to the capacity 
of a minimum a-z cut. 

Let us now indicate how flows in a directed network can be used to model a large 
variety of extensions in directed and undirected networks. In Example 1, we showed 
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Figure 4.9 

how to model a problem with several supplies and demands by a network with one 
source and one sink. 

Example 5: Undirected Networks 

Suppose the undirected network in Figure 4.9 represents a network of telephone lines 
(the capacity of an edge is the number of calls the line can handle). We wish to know 
the maximum number of calls that the network can simultaneously carry between 
locations a and z. That is, we seek the value of a maximum flow in this network. 

To make the network directed, we can replace each undirected edge (x, y) by the 
two edges (jc - *, y) and (y r , x ), each with the same capacity as (x, y). An equivalent 
approach is to allow a directed flow in undirected edges. If e — (x, y), /(e) would be 
a number with an “arrow” indicating whether the flow goes from x to y or from y to x. 
Step 2 of the flow algorithm is modified as follows: when checking edges at a labeled 
vertex p, edges with a flow directed inward are treated like incoming edges and edges 
with no flow or flow away from p are treated like outgoing edges. ■ 

Example 6: Edge-Disjoint Paths in a Graph 

We are going to send messengers from a to z in the graph shown in Figure 4.10. 
Because certain edges (roads) may be blocked, we require each messenger to use 
different edges. How many messengers can be sent? That is, we want to know the 
number of edge-disjoint paths. 

We convert this path problem into a network flow problem by assigning unit 
capacities to each edge. One could think of the flow as “flow messengers,” and the 
unit capacities mean that at most one messenger can use any edge. The number of 
edge-disjoint paths (number of messengers) is thus equal to the value of a maximum 
flow in this undirected network. (See Example 5 for flows in undirected networks.) 

Observe that we have implicitly shown that a maximum a-z flow problem for a 
unit-capacity network is equivalent to finding the maximum number of edge-disjoint 
a-z paths in the associated graph (where edge capacities are ignored). This equiva- 
lence can be extended using multigraphs to all networks by replacing each ^-capacity 




Z 


Figure 4.10 
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edge ( x , y) with k multiple unit-capacity edges and now proceeding with the above 
conversion. ■ 


Example 7: Dynamic Network Flows 

We want to know how many autos can be shipped from location a to location z in four 
days through the network in Figure 4.1 1 a. We assume that each edge (x7 y) is the 
route of a train that leaves location x daily for a nonstop run to location y. The first 
number associated with an edge is the capacity of the trains (number of autos). The 
second number is the number of days the trip takes. Autos may be left temporarily at 
any location in the network. 

We turn this dynamic problem into a static a-z flow problem by adding the 
dimension of time to our network: each vertex x is replaced by five vertices x 0 , X| , 
X 2 , X 3 , X 4 , where the subscript refers to the zth day in the four-day shipping time 
(the starting day is day 0). For each original edge {xT, y), which takes k days to 
traverse, we make edges (xcf, yk), 0 *i"*, yk+ 1 ), . . . , (.* 4 _a~, .V 4 ), each with the same 
capacity as (x*, y). For each vertex x, we make four edges of the form (x~, x, + i) 
with very large (in effect, infinite) capacity; these edges correspond to the provision 
that permits autos to be left temporarily at any vertex from one day to the next. See 
Figure 4.1 1 b. 

A maximum flow in the new network gives the maximum dynamic flow in the 
original network. Note that for vertices other than a and z, the range of the subscripts 
of usable vertices will be at most 1 to 3. Nondaily trains could easily be incorporated 
into the model. ■ 
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It is hoped that the preceding examples have impressed the reader with the ver- 
satility of our basic static a—z flow model. More examples are to be found in the 
exercises. 



4.3 EXERCISES 


Summary of Exercises Exercises 1-17 mimic or extend the flow 
computations and models in Examples 3-7. Exercises 18-38 develop the theory of 
network flows (later exercises are very challenging). Exercises 39-42 present pro- 
gramming projects. 

In the following problems, unless directed otherwise, the reader should route 
most of the flow by inspection and then, when a near-optimal flow is obtained, use 
the augmenting flow algorithm to get further flow and afterwards a minimum cut. 


1. Apply the augmenting flow algorithm to the flow in Figure 4.5. 

2. Find a maximum a-z flow and minimum capacity a-z cut in the following 
networks: 



3. Find a maximum flow from a to z in the network of Figure 4.9 (using the 
associated directed network). Also give a minimum capacity a-z cut. 

4. Treat the first number assigned to each edge in Exercise 3 of Section 4.1 as a 
capacity and let the edges be directed by the alphabetical order of their endpoints 
with L preceding a and W following n [e.g., edge if. i ) goes from /to /]. 

(a) Find a maximum L-W flow and a minimum L-W cut in this network. 

(b) By inspection, find a different maximum L-W flow. 

(c) Make a misrouted flow that yields a saturated L-W cut whose capacity is 
greater than the flow (similar to the situation in Figure 4.7e). Now apply the 
algorithm. 
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(d) In addition, let the second number of each edge be a lower bound on the flow 
in that edge. Try to find an L-W flow satisfying both constraints. 

5. Delete vertices p through y in Figure 4.2 in Section 4.2 and treat the remaining 
edge numbers as capacities (edges are still undirected). Use the undirected version 
of the flow algorithm suggested in Example 5. 

(a) Find a maximum flow from b to j and a minimum b-j cut. 

(b) Build a b-j flow that saturates edge ( k , /) (in either direction) and then apply 
the algorithm. 

(c) Treat the edge numbers as lower bounds and modify the algorithm to find a 
maximum b-j flow, using the whole network (a through y). Remember that 
there is no flow into b or out of j. 

6 . Is there a flow meeting the demands in Figure 4.4? 

7. Suppose vertices b, c, d in Figure 4.4 have unlimited supplies. How much flow 
can be sent to the set {h, i,j}l Explain your model. 

8. Vertices b, c, d have supplies 30, 20, 10, respectively, and vertices j , k have 
demands of 30 and 25 in this network. 


30 



30 


wlO 


' 25 


(a) Find a flow satisfying the demands, if possible. 

(b) Reverse the direction of edge {h, g) and repeat part (a). 

9. Solve the messenger problem in Example 6. 

10. Suppose that up to three messengers can use each edge in Example 6. Now how 
many messengers can be sent? Is the answer with such a modification always 
just three times the answer to the original problem? 

11. (a) Ignoring the numbers of the edges in Exercise 3 of Section 4.1, what is the 

size of the largest set of edge-disjoint paths from L to W1 

(b) What is the size of the largest set of paths from L to W such that no edge is 
used by more than five paths? 
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12. Suppose that no more than five units of flow can go through each intermediate 
vertex b, c, d, in Figure 4. Ha. Now find a maximum flow in this revised network 
and associated minimum a-z cut. 

13. Suppose that no more than 20 units of flow can go through each intermediate 
vertex b, c, d, e,f, g in Figure 4.4. Now find the maximum flow in this revised 
network. 

14. What is the size of a largest set of vertex-disjoint paths from a to z in Figure 4.10? 

15. Solve the dynamic flow problem in Example 7. 

16. In Example 7, suppose that the trains do not run every day. Let trains depart 
from a and c on Monday, Wednesday, and Friday, and from b and d on Tuesday, 
Thursday, and Saturday. In one week, Monday through Sunday, how many cars 
can be sent from a to z in that network? 

17. Suppose that it takes one day to traverse each edge in Figure 4.5. How many 
units can be moved from a to z in five days in this network? 

18. In the proof of Theorem 3, show that f° = mf K satisfies the second part of 
condition (b). 

19. (a) Prove that if a directed network contains edges (x/ y) and (y r , x) for some 

x, y, then the augmenting flow algorithm would never make assignments that 
would have flow occurring simultaneously in both edges. 

(b) Could edges (x~*, y ) and (y r , x) both get flow if the augmenting flow algorithm 
in Step 2 checked outgoing edges before incoming edges? 

20. (a) Restate the augmenting flow algorithm so that the labeling starts at z and 

“works back” to a. 

(b) Restate the augmenting flow algorithm for undirected networks (see 
Example 5). Sketch a proof of this algorithm. 

21. Show that the set of edges used to label vertices in Steps 2a and 2b of the 
augmenting flow algorithm form a tree rooted at a. 

22. (a) Give a weakened replacement for condition (b) in the definition of an 

a-z flow. The new condition should still ensure that the net flow is from 
a to z. 

(b) Suppose condition (b) is eliminated. Can there exist maximum flows that 
violate condition (b)? Prove or give a counterexample. 

23. Build a flow in the network in Figure 4.8 a with the prescribed properties: 

(a) Its value |/| is 0, but not all edges have 0 flow. 

(b) Its value is 2, but it cannot be decomposed into a sum of a-z flow paths. 

24. (a) Show that for a flow / in a (directed or undirected) network, if / is circuit- 

free — that is, there is no set of edges with flow that form a (directed) circuit — 
then / can be decomposed into a sum of a-z flow paths. {Hint: Prove by 
induction on the value off.) 

(b) Use the proof to get a decomposition algorithm for any such/. 
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(c) Conclude that any flow /can be decomposed into |/| a-z flow paths plus a 
set of circuits. 

25. (a) Prove that starting from a circuit-free flow (see Exercise 24), perhaps a zero 

flow, the maximum flow generated by the augmenting flow algorithm is 
circuit-free and hence [by Exercise 24(b)] can be decomposed into a sum of 
a-z flow paths. 

(b) Use part (a) and Exercise 24(b) to find the routings of a maximum set of 
phone calls in Example 5. 

26. How many times is an edge checked in Step 2 to perform one complete it- 
eration of the augmenting flow algorithm (resulting in increased flow or a 
min-cut)? 

27. Show that if a flow is decomposed into unit-flow paths and if each unit-flow path 
crosses a given saturated cut once, then the flow is maximum. 

28. A cut-set in an undirected graph G is a set S of edges whose removal disconnects 
G such that no proper subset of S disconnects G. Prove that in an undirected flow 
network, every cut-set that separates a and z is an a-z cut and every minimum 
a-z cut is a cut-set. 

29. Let G be a connected, undirected graph and a, b be any two vertices in G. 

(a) Show that there are k edge-disjoint paths between a and b if and only if every 
a-b cut has at least k edges. 

(b) Show that there are k vertex-disjoint paths between a and b if and only if 
every set of vertices disconnecting a from b has at least k vertices. 

30. As mentioned in Example 6, we can model a flow network N (directed or undi- 
rected) by another multigraph flow network N' in which each edge has unit 
capacity; N' has the same vertices as N and for each edge e in N there are k(e) 
edges in N ' paralleling e. Since a flow in N' takes 0 or 1 values on the edges, we 
can drop the capacities in N' to get a multigraph G' . A flow in N’ is just a subset 
of edges (with flow) in G' . 

(a) Characterize the subsets of edges in G' that correspond to an a-z flow in N' . 

(b) Restate the augmenting flow algorithm in terms of G' . 

(c) Using the multigraph model, prove that any a-z flow/in N' contains \ f\ a-z 
flow paths. 

(d) Using the multigraph model and assuming the result in Exercise 29(a), prove 
Corollary 3a (max flow-min cut theorem). 

31. Suppose the numbers on the edges in a directed network represent lower bounds 
for the flow. State a decreasing flow algorithm. Sketch a proof of this algorithm 
and deduce the counterpart of Corollary 3a. 

32. (a) Explain how the algorithm in Exercise 3 1 can be applied to a bipartite graph 

to find a minimum set of edges incident with every vertex in the graph. 

(b) Find such an edge set for the bipartite graph in Figure 4.12 of Section 4.4. 
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33. Suppose that we have upper and lower bounds k t (e) and k^ie), respectively, on 
the flow in each edge e in a directed network, and that we are given a feasible 
flow (satisfying these constraints). 

(a) Modify the augmenting flow algorithm so that it can be used to construct a 
maximum flow from a given feasible flow. 

(b) Prove that the maximum flow has value equal to the minimum of k\ ( P , P) — 
k 2 (P, P), among all a-z cuts (P, P), where A: | (.S'. S) and k 2 (S. S) are the sums 
of the upper and lower bounds of edges in a cut ( S , S). 

34. Consider a directed network with supplies and demands as in Example 1 . Let 
z(P) be the total demand of vertices in set P and a(P) be the total supply of 
vertices in P. 

(a) Prove that the demands can be met if and only if for all sets P, z(P) — 
a(P) < k( P , P). (Hint: Generalize the reasoning in Example 1.) 

(b) Prove that the supplies can all be used if and only if for all P, a( P) — 
z(P) < k(P, P). 

35. Suppose the edges in Figure 4.11a in Example 7 were undirected. How would 
we construct a static flow model to simulate this dynamic flow problem so that 
a maximum static flow in the new network would correspond to a maximum 
dynamic flow? Are there any difficulties? 

36. Suppose an undirected flow network is a planar graph and a (at the left side) 
and z (at the right side) are both on the unbounded region surrounding the 
network. Draw edges extending infinitely to the left from a and to the right 
from z; give them infinite capacity. This divides the unbounded region into two 
unbounded regions, an upper and a lower unbounded region. Now form the dual 
network (see Section 1 .4) of this planar network with each dual edge assigned 
the capacity of the original edge it crosses. 

(a) Show that a path from the upper unbounded region’s vertex to the lower 
unbounded region’s vertex in the dual network corresponds to an a-z cut 
in the original network. Thus, a shortest such path in the dual network is a 
minimum a-z cut in the original network. 

(b) Draw the dual network for the network in Figure 4.9 and find a shortest path 
(using the algorithm in Section 4.1) corresponding to a minimum cut in the 
original network. 

37. Suppose an undirected flow network A is a planar graph and a (at the left side) 
and z (at the right side) are both on the unbounded region surrounding the 
network. Consider the following flow-path building heuristic. Starting from a, 
build an a-z flow path by choosing at each vertex x the first unsaturated edge in 
clockwise order starting from the edge used to enter x. 

(a) Apply this heuristic to the network in Figure 4.9 to find a maximum 
flow. 

(b) Show that repeated use of this heuristic yields a maximum flow in N. 
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38 . Prove that repeated use of the augmenting flow algorithm yields a maximum 
flow in a finite number of applications for networks with irrational capacities 
provided that vertices are ordered (indexed) and that the next vertex scanned in 
Step 2 of the algorithm is the labeled vertex with lowest index. 

39 . Write a program to find maximum flows in directed networks (the networks are 
input data). 

40 . Write a program to find the maximum number of paths in an undirected graph 
between two given vertices such that the following are true: 

(a) The paths are edge disjoint. 

(b) The paths are vertex disjoint. 

41 . Write a program that when given a network with an a-z flow of value k, will 
extract from the flow k unit-flow paths from a to z. 


42 . Write a program to find, for a given pair of vertices a and b in a given connected 
graph, a minimum set of vertices whose removal disconnects a from b (see 
Exercise 29). 



In this section, we apply network flows to the theory of matchings. Recall that a 
bipartite graph G = (X, Y, E) is an undirected graph with two specified vertex sets 
X and Y and with all edges of the form (x, y), x e X, y e Y . See Figure 4.12. Bipartite 
graphs are a natural model for matching problems. We let X and The the two sets to 
be matched and edges ( x , y) represent pairs of elements that may be matched together. 

A matching in a bipartite graph is a set of independent edges (with no common 
endpoints). The thicker edges in Figure 4.12 constitute a matching. An A-matching 
is a matching involving all vertices in X. A maximum matching is a matching of the 
largest possible size. As with network flows, we cannot always obtain a maximum 
matching in a bipartite graph by simply adding more edges to a non-maximum 
matching. The matching indicated in Figure 4.12 cannot be so increased, even though 
it is not maximum. 

A typical matching problem involves pairing off compatible boys and girls at a 
dance or the one-to-one assignment of workers to jobs for which they are trained. 



Figure 4.12 


m 
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A closely related problem is to find a set of distinct representatives for a collection 
of subsets. We need to pick one element from each subset without using any element 
twice. In the bipartite graph model, we make one X-vertex for each subset, one 
7-vertex for each element, and an edge (x, y ) whenever element y is in subset x. Now 
an X-matching picks a distinct representative element for each subset. Conversely, 
any matching problem can be modeled as a set-of-distinct-representatives problem. 

We employ a modification of the trick in Example 1 of Section 4.3 to turn a 
matching problem into a network flow problem. Associate a supply of 1 at each 
X- vertex and a demand of 1 at each 7-vertex. The capacities of the edges from X to 7 
can be any large positive integers, but it is convenient to assume that these capacities 
are oo . We also assume that edges are directed from X to 7. Now we apply the technique 
in Example 1 of Section 4.3 with source a connected by a unit-capacity edge to each 
X- vertex and sink z connected by a unit capacity edge from each 7- vertex. 

We call such a network a matching network. See Figure 4.13. The X-Y edges 
used in an a-z flow constitute a matching. A maximum flow is a maximum matching. 
A flow saturating all edges from source a corresponds to an X-matching. 

Example 1: Maximum Matching 

Suppose the bipartite graph in Figure 4.12 represents possible pairings of boys b, c, 
d, e,/with girls g, h, i,j, k, m. A tentative matching indicated by thicker edges in 
Figure 4.12 was made. Although this matching cannot be extended, we still wonder 
whether a complete X-matching is possible. 

As has happened before in flow problems, we have made a “mistake” in this 
matching and now need to make some reassignments. We convert this matching into 
the corresponding flow in the associated matching network: darkened edges in Figure 
4.13 have a flow of 1, other edges have a flow of 0. Now we apply the augmenting 
flow algorithm. See Figure 4.13. 

From a, the only outgoing edge that is not saturated is {cT, c), since c is the only 
X-vertex not involved in the initial matching. We label c (a + , 1). From c we can label 
the two T-vertices, g and i, adjacent to c with the label (c + , 1). At g the one outgoing 
edge — to z — is saturated. Then there must be an incoming edge to g with positive 
flow, namely (b", g), coming from the unlabeled X-vertex b. We label b (g~ , 1). By 
similar reasoning, from ; we label 1). At b and / we can label T-vertices not 
currently matched to b and/. We label j (b + , 1) and m (/+, 1). 



Figure 4.13 
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Summarizing our labeling thus far, we are trying to find a match for c (the only 
A-vertex currently unmatched). We have possible matches of c with g and i, but to 
make either match, we must sever the current match of g with b or of i with /and 
rematch b or/with other K-vertices — in this network, h can be rematched withy or/ 
rematched with m. Continuing this reasoning, to rematch b with j requires us to delete 
y’s current match with e and find a new match for e. Or to rematch/with m requires 
us to delete m’s current match with d and find a new match for d. Now either d can 
be matched with h (currently unmatched) or e can be matched with k (also currently 
unmatched). The fact that h and k are unmatched is reflected in the fact that from 
either h or k one can label z. See the labels in Figure 4.13. 

If we label z (k + , 1), the augmenting flow chain prescribed by the algorithm is 
a-c-g-b-j-e-k-z. Our augmenting flow chain specifies that we add edge (cd g) to our 
matching, remove edge (bd g), add edge (bd j), remove edge (ed j), and add edge 
(e~*, k). The new flow corresponds to the matching b—j, c-g, d-m, e-k,f-i. Another 
application of the algorithm would label only a. The set of edges leaving a now forms 
a (saturated) minimum a-z cut — a sign that we have an A-matching. ■ 

Observe that the action of the augmenting flow algorithm in matching problems 
can be described as follows: starting from an unmatched vertex xi in A, we go on a 
nonmatching edge (an edge not currently used in the matching) to a matched Y - vertex 
yi, then we move back from >>| along a matching edge to a matched A-vertex .ty, 
then we go on a nonmatching edge from X 2 to a matched vertex yy, and so on until a 
nonmatching edge (xk , }'k ) goes from a matched A- vertex Xk to an unmatched Y- vertex 
yk . In short, starting from an unmatched A- vertex, we create an odd-length alternating 
path L of nonmatching and matching edges in search of an unmatched T-vertex. Given 
such a path, we get a new, larger set of matching edges by interchanging the roles of 
matching and nonmatching edges on L. 

We have seen that in bipartite graphs, a matching is analogous to a flow in a 
network. What then is the bipartite graph counterpart to an a-z cut? Actually it is 
convenient to restrict our attention to finite-capacity a-z cuts. The corresponding 
concept is an edge cover, a set S of vertices such that every edge has a vertex of 
S as an endpoint. Recall that edge covers were encountered in a street surveillance 
problem in Example 4 of Section 1.1. 

Lemma 

Let G — (A, Y, E ) be a bipartite graph and let Abe the matching network associated with 
G. For any subsets (possibly empty) Ac A and B C Y, S = A U B is an edge cover 
if and only if ( P , P)) is a finite capacity a-z cut in N, where P = a U (X — A)L) B . 
In terms of P, S = (P fl A)U (P n T). Further, |S| = k(P, P). 

Proof 

A finite capacity a-z cut cannot contain an edge between A and Y (whose capacity 
is oo). Then a finite-capacity cut ( P , P) must consist of edges of the form (ad x), 
x e A and (y~*, z), y e H for some sets AC A and B Cf. These edges block all 
flow (and thus are an a-z cut) if and only if any A —Y edge starts at some x e A or 
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Figure 4.14 


pnx * Pny 



ends at some y e B — that is, if and only if S = A U /i is an edge cover. In terms 
of A and B, P = a U (X - A) U B. and in terms of P, A — PHX and B = P n Y. 
See Figure 4.14. Also, \S\—k(P,P), since the edges in (P, P) all have unit 
capacity. ♦ 


We now prove two famous theorems about matching in bipartite graphs. 

Theorem 1 (Konig-Egervary) 

In a bipartite graph G = ( X , Y, E), the size of a maximum matching equals the size 
of a minimum edge cover. 


Matchings correspond to flows in the associated matching network and, by the 
lemma, edge covers correspond to (finite capacity) a—z cuts. Theorem 1 is simply 
a bipartite-graph restatement of the max flow-min cut theorem (Corollary 3a in 
Section 4.2). 

In the next theorem, the range R(A) of A denotes the set of vertices adjacent to 
a vertex in A. 


Theorem 2 (Hall’s Marriage Theorem) 

A bipartite graph G — (X, Y, E) has an A"- matching if and only if for each subset 
A C A, |/?(A)| > | A], 

Proof 

The condition is necessary, for if |/?(A)| < |A|, a matching of the vertices in A is 
clearly impossible, and so an A- matching is impossible. 

Next we show that if |/?(A)| < | A|, for all A C X, then G has a A-matching. By 
Theorem 1, an A- matching exists if and only if each edge cover S has size .S' > | A|. 
If S contains only A-vertices — that is, S = X — then the result is immediate. We must 
show that S cannot be made smaller by dropping some A-vertices and in their place 
using a smaller number of T-vertices. If A is the set of A-vertices not in .S, then .S fi Y 
must contain the vertices in R(A) to cover the edges between A and R(A). Then we 
have 


\s\ = |snA| + |snr| = |A| - |A| + |sny| 
> | A | — | A | -+- |/?(A)| 
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Since |/?(A)| > |A|, then |X| — |A| + |/?(A)| > |X|, and so |5| > |X|, proving that 
there is a matching of size X. ♦ 

Theorems 1 and 2 are the starting point for a large family of matching theorems 
(for example, see Exercises 18 and 23). 

Example 2 

Country A would like to spy on all meetings in its territory between diplomats from 
Country X and from Country Y. We know which pairs of diplomats are likely to meet. 
We can make a bipartite graph expressing this likely-to-meet relationship. Country A 
cannot afford to assign spies to every X diplomat or to every Y diplomat. Instead it 
wants to find a minimum set S of diplomats such that every possible meeting would 
involve a diplomat of S. 

Country A hires a graduate of this course, who immediately sees that this min- 
imum spying problem is really a minimum edge cover problem in disguise. The 
problem can thus be solved by using the augmenting flow algorithm to find a min- 
imum a-z cut in the associate matching network and, from the a—z cut, obtain a 
minimum edge cover using the lemma. ■ 

Example 3 

Suppose there are n people and n jobs, each person is qualified for A: jobs, and for each 
job there are k qualified people. Is it possible to assign each person to a (different) 
job he or she can do? 

We model this problem using a bipartite graph G = (X, Y, E ) with A - - vert ices 
for people and F-vertices for jobs. Note that each vertex will have degree k. The 
question is now: is there an X -matching? From Theorem 2, we can deduce a yes 
answer as follows: Let A be any subset of X. Since each vertex has degree k, there 
will be k\ A | edges leaving A. Since at most k of these edges can go to any one vertex 
in Y, it follows that R( A) has at least |A| vertices. Then, by Theorem 2, there is an 
A" -matching. ■ 

We close this section with an application of matching networks to the sports 
problem of determining late in a season whether a particular team still has a math- 
ematical chance of being conference champion. The network model and associated 
analysis presented here are due to Schwartz [4], 

Example 4: Elimination from Contention 

Suppose we know how many games each of the teams in a sports conference has won 
to date and how many games remain to be played between each pair of teams. We 
want to know if there exists a scenario under which a specified team could finish the 
season with the most wins in the conference. 

Let us consider a concrete example with four teams, the Bears, the Lions, the 
Tigers, and the Vampires, with the records of wins and games to play in Figure 4.15. 
Suppose we wonder if it is possible for the Bears (currently with fewest wins) to end 
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Figure 4.15 


Wins Games with with with with 


Team 

to date 

to play 

Bears 

Lions 

Tigers 

Vampires 

Bears (q) 

16 

7 

— 

2 

2 

3 

Lions (f 2 ) 

22 

7 

2 

— 

3 

2 

Tigers (f 3 ) 

20 

8 

2 

3 

— 

3 

Vampires (U) 

19 

8 

3 

2 

3 

— 


the season with the most wins. The problem is that even assuming the Bears win the 
rest of their games to finish with 23 wins, the outcomes of the games between the 
other teams may always result in one of those teams finishing with more wins than 
the Bears. Thus, the challenge is to find an answer to the following question: 

Does there exist a way to assign the wins in the games among other 

teams so that each of those teams finishes with at most 23 wins? 

We describe how to construct a matching network model to answer this question. 
Figure 4.16 gives the network for the data in Figure 4.15. The X- vertices will represent 
the other teams besides the Bears. In Figure 4. 16 the X - vertices will be L (for Lions), 
T (for Tigers), and V (for Vampires). The T-vertices will represent all different pairs 
of other teams. In this case, LT, LV, and TV. 

The size of the flow in the edge from source a to A'-vertex t, will stand for the 
number of games won by the ;-th team t, in the rest of the season. The capacity of 
the edge ( a *, t , ) will be 23 — w,-, where w,- denotes the number of wins to date by tj. 
Observe that if team tj has won w, games to date, and if it wins at most 23 — w; games in 
the rest of the season, then t, cannot finish with more than 23 wins, enabling the Bears 
to be the champion or co-champion (assuming the Bears win all remaining games). 

The flow in the X-Y edge (t", (f, ,tj)) will represent the number of wins of t, over 
tj in their remaining games. We can let the capacity of all X-Y edges be oo. The flow 
in the edge from the Y - vertex (tj, tj) to the sink z represents the number of games 
that will be played between teams t, and tj. The capacity of ((tj, tj )" , z) is the number 
of scheduled remaining games between tj and tj. Conservation of flow at vertex r, 
requires that flow in (the number of games won in the rest of the season) = flow out 
(the number of future wins over various other teams). Conservation of flow at the 
vertex (tj, tj) requires that flow in (the wins of t, over tj and of tj over tj) — flow out 
(the number of games these two teams will play). 


L 0 LT 



Figure 4.16 
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To have a flow that represents a possible real scenario, we require that the edges 
from Y to z all be saturated — the number of games played between t; and tj is the actual 
number of games scheduled between them. If a flow exists in this network that saturates 
the Y-z edges, it produces a scenario in which the Bears end the season with the most 
wins. Figure 4.16 presents a flow saturating the Y-z edges showing how the Bears 
could end up with the most wins (actually, all four teams would tie for first place). ■ 

Optional We now extend the preceding sports network model to give a necessary 
and sufficient condition for a particular team t * to be able to be conference champion. 
For a subset S of teams, let w(A) denote the total number of wins to date by all teams 
in S and let r(S) be the total number of games remaining to be played between all 
pairs of teams in S. Observe that for any subset S of teams, the total number of wins 
at the end of the season by teams in S will be at least w(S) + r(S) (because when 
two teams in S play each other, one of them must win). Let W* be the final number 
of wins by t * if t* wins all remaining games. One constraint that guarantees that t* 
cannot finish with the most wins is if the average number of wins by any subset S not 
containing t* exceeds W* — that is, 

for any subset S not containing t* , {w(S) + r(S)}/|S| > W* (*) 

We shall show that constraint (*) is sufficient as well as necessary to characterize 
when team f is eliminated. If the min-cut is the set of Y-z edges, then there is a 
flow saturating all Y-z edges and specifying a scenario in which t* has the most wins 
(as described in the example above). Suppose there is an a-z cut ( P , P) of smaller 
capacity. We now show that (*) is satisfied by some subset S. 

(P, P) cannot contain any of the infinite-capacity edges from X to Y, and so it 
consists solely of a-X edges and Y-z edges. Recall that edge (cT, t, ) has capacity 
W* - w, . Then if A = P fl X, the a-X edges in (P, P) have capacity | A | W* — w(A). 
One can show that if f, and tj are in P, then (f, , tj) must be in P (details are left as an 
exercise). Then the Y-z edges in (P, P) have capacity where r i; - denotes 

the capacity of edge ((?, , tj f, z) (= the number of games remaining to be played 
between t ,■ and tj) and we sum over all (f;, tj) pairs for which at least one of f,- or tj 
is not in A. So k(P, P) = [\A\W* - w(A)] + ‘.Ary- 

Assuming k(P, P) is less than L r, ; - , the capacity of all the edges from Y to z, 

then 


| A | W* - w(A) + E 

(ti,tj)<£A 

=> \A\W* - w(A)<J2 r u- E r ‘j= E r U = r ( A ) 

(t t ,tj)<£A (t,,tj)CA 

=> |A|W* < w(A) + r(A)=> W* < {w(A) + r(A)}/| A| 


This last inequality is (*), as claimed, with the subset being A. The teams in A average 
more wins than f, and hence t* cannot be conference champion. 
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4.4 EXERCISES 


' Summary of Exercises Exercises 1-14 involve matching net- 
works. Exercises 15-24 develop theory. 

1. Bill is liked by Ann, Diana, and Lolita; Fred is liked by Bobbie, Carol, and Lolita; 
George is liked by Ann, Bobbie, and Lolita; John is liked by Carol and Lolita; 
and Larry is liked by Diana and Lolita. We want to pair each girl with a boy she 
likes. (Make girls the vertices on the left side.) 

(a) Set up the associated matching network and maximize its flow to solve this 
problem. 

(b) Make a flow corresponding to a partial pairing that has Bill with Diana and 
Fred with Carol along with two other matches (chosen by the reader). Now 
apply the flow augmenting algorithm to increase the matching to a complete 
matching. 

2. Suppose there are five committees: committee A’s members are a, c, e; committee 
B’s members are b, c; committee C’s members are a, b, d\ committee D’s members 
are d, e, /; and committee E’s members are e, f. We wish to have each committee 
send a different representative to a convention. 

(a) Set up the associated matching network and maximize its flow to solve this 
problem. 

(b) Make a flow corresponding to the partial assignment: A sends e, B sends b, C 
sends a , and D sends/. Now apply the flow-augmenting algorithm to increase 
the matching. 

3. Let us repeat Exercise 1, but this time Bill gets a total of five dates, Fred four 
dates, George three, John five, and Larry three, while Ann gets four, Bobbie three, 
Diana five, Carol four, and Lolita four. Compatible pairs may have any number 
of dates together. Model with a network to find a possible set of pairings. 

4. Let us repeat Exercise 1, but this time we want to pair each boy twice (with 
two different girls) and each girl twice. Find the pairings using an appropriate 
network flow model. 

5. Suppose that there are six universities and each will produce five mathematics 
Ph.D.s this year, and there are five colleges that will be hiring seven, seven, six, 
six, five math Ph.D.s, respectively. No college will hire more than one Ph.D. from 
any given university. Will all the Ph.D.s get a job? Explain. 

6. In Example 4, is it possible for the Vampires to be champions (or co-champions) 
if they win all remaining games? Build the appropriate network model. 

7. (a) In the following table of remaining games, is it possible for the Bears to be 

co-champions, or sole champions, if they win all remaining games? Build the 
appropriate network model and show the required flow, if possible. Answer 
the question with a feasible flow in the network you created or with an 
explanation of why one is not possible. If there are co-champions with the 
Bears, who are they? 
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(b) Is it possible for the Bear to be sole champions? 


Wins Games with with with with 


Team 

to date 

to play 

Bears 

Lions 

Tigers 

Vampires 

Bears 

21 

7 



1 

2 

4 

Lions 

26 

6 

1 

— 

3 

2 

Tigers 

26 

7 

2 

3 

— 

2 

Vampires 

22 

8 

4 

2 

2 

— 


8. In the following table of remaining games, is it possible for the Vikings to be 
co-champions, or sole champions, if they win all remaining games? Build the 
appropriate network model and flow. Answer the question with a feasible flow 
in the network you created or with an explanation of why one is not possible. If 
there are co-champions with the Bears, who are they? 


Wins Games with with with with 


Team 

to date 

to play 

Vikings 

Huns 

Romans 

Mongols 

Vikings 

22 

6 

— 

2 

2 

2 

Huns 

27 

6 

2 

— 

2 

2 

Romans 

26 

6 

2 

2 

— 

2 

Mongols 

25 

6 

2 

2 

2 

— 


9. In the following table of remaining games, is it possible for the Bears to be 
co-champions, or sole champions, if they win all remaining games? Build 
the appropriate network model. Answer the question with a feasible flow in 
the network you created or with an explanation of why one is not possible. 
If there are co-champions with the Bears, who are they? 


Wins Games with with with with 


Team 

to date 

to play 

Bears 

Lions 

Tigers 

Vampires 

Bears 

18 

8 

— 

1 

3 

4 

Lions 

25 

6 

1 

— 

2 

3 

Tigers 

24 

7 

3 

2 

— 

2 

Vampires 

22 

9 

4 

3 

2 

— 


10. In the following table of remaining games, is it possible for the Bears to be 
co-champions, or sole champions, if they win all remaining games? Build 
the appropriate network model. Answer the question with a feasible flow in 
the network you created or with an explanation of why one is not possible. 
If there are co-champions with the Bears, who are they? 
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Wins Games with with with with 


Team 

to date 

to play 

Bears 

Lions 

Tigers 

Vampires 

Bears 

20 

6 

— 

1 

2 

3 

Lions 

25 

6 

1 

— 

3 

2 

Tigers 

25 

6 

2 

3 

— 

1 

Vampires 

22 

6 

3 

2 

1 

— 


11 . There are n boys and n girls in a computer dating service. The computer has 
made nm pairings so that each boy dates m different girls and each girl dates m 
different boys (m < n). 

(a) Show that it is always possible to schedule the nm dates over m nights — that 
is — that the pairings may be partitioned into m sets of complete pairings. 

(b) Show that in part (a), no matter how the first k complete pairings are selected 
(0 <k< m), the partition can always be completed. 

12 . We want to construct an n x m matrix whose entries will be nonnegative integers 
such that the sum of the entries in row i is r, , and the sum of the entries in column 
j is Cj. Clearly the sum of the r, s must equal the sum of the t/S. 

(a) What other constraints (if any) should be imposed on the r, s and Cj s to ensure 
such a matrix exists? 

(b) Construct such a 5 x 6 matrix with row sums 20, 40, 10, 13, 25 and column 
sums all equal to 18. 

13 . We have a group of people and each is a member of a subset of committees. In 
addition, each person graduated from (exactly) one of three different universities. 
In this extension of the “distinct representatives” problem, we seek a unique 
person to represent each committee with the added constraint that a third of the 
representatives must have graduated from each of the three universities. Describe 
how build a network to model this problem. (Assume that m, the number of 
committees, is a multiple of 3.) 

14 . (Due to Bacharach) We are given an n by n matrix with numerical entries that 
all have one digit to the right of the decimal point (e.g., 13.3). We want to round 
the entries to whole numbers in a fashion so that the sum of the rounded entries 
in each column (and row) is a rounded value of the original column (row) sum; 
e.g., if the first column has entries 2.5, 6.4, 5.7 summing to 16.6, then the sum 
of the rounded values of these three entries would have to be 16 or 17. 

(a) Describe how to build a matching-type network flow model of this problem. 
Let X-vertices represent the columns and K-vertices represent the rows. Note 
that this model will have lower as well as upper bounds for each edge. 

(b) Find the required rounding, if possible, using the network model developed 
in part (a) for the following matrix: 


4.5 

7.5 

2.5 

6.8 

4.3 

5.7 

3.6 

1.6 

4.3 
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15. Give necessary and sufficient conditions for the existence of a circuit or a set 
of vertex-disjoint circuits that pass through each vertex once in a directed graph 
G = (V, E). [Hint: Make a bipartite graph G' = ( X , Y E) with X and Y copies of 
V, and for each edge (vf, V 2 ) in G, G' has an edge (x\ , >> 2 ); restate the problem.] 

16. Prove that a bipartite graph G = (X, Y, E) has a matching of size t if and only 
if for all A in X, |/?(A)| > | A\ + 1 — |A| = t — \X — A\. {Hint: Add \X\ — t new 
vertices to Y and join each new T-vertex to each A- vertex.) 

17. Show that every bipartite-graph matching problem can be modeled as a set-of- 
distinct-representatives problem. 

18. In the analysis following Example 4, show that if f; and tj are in P, then vertex 
(f;, tj) is in P. 

19. Prove Theorem 2 using Exercise 34 in Section 4.3, which applies directly to the 
bipartite network, not the augmented a-z matching network. 

20. Prove Theorem 2 for complete T-matchings (without simply interchanging the 
roles of X and Y). By symmetry, the same condition is required, but the set A in 
the reproof is chosen differently from the A in the text’s proof. 

21. Let 8(G) — maxAcxd A| — |/?(A)|. 5(G) is called the deficiency of the bipartite 
graph G = (X, Y, E) and gives the worst violation of the condition in Theorem 2. 
Note that 8(G) > 0 because A = 0 is considered a subset of X. 

(a) Use Exercise 16 to prove that a maximum matching of G has size | X\ — 8(G). 

(b) Given a maximum matching of size t = \X\ —8(G) [assume 8(G) > 0], 
describe how the associated minimum edge covering of Theorem 1 can be 
used to find an A such that |A| — |/?(A)| = 8(G). 

22. (a) Show that the size of the largest independent set of vertices (mutually non- 

adjacent vertices) in G = (X, Y, E) is equal to |F| + 8(G) (see Exercise 21). 
Describe how to find such an independent set. 

(b) Use part (a) to find such an independent set in Figure 1.3. 

23. (Due to J. Hopcroft) Suppose each vertex of a bipartite graph G has degree 2 r , 
for some r. Partition the edges of G into circuits, and delete every other edge in 
each circuit. Repeat this process on the new graph (where each vertex now has 
degree 2 r ~ 1 ), and continue repeating until a graph is obtained with each vertex 
of degree 1. The edges in this final graph constitute a matching of the vertices 
of G. Show that such a partition of edges into circuits exists in each successive 
graph and can be found in a number of steps proportional to the number of edges 
in the current graph. 

24. Suppose we are given a partial matching to an arbitrary graph (such a matching 
is a set of edges with no common endpoints). 

(a) Prove that a generalization of the interchange method along a path alternating 
between nonmatching and matching edges (described following Example 1 ) 
can be used to increase the size of the partial matching until it is a maximum 
matching. 
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(b) Randomly pick a partial matching for the graph in Figure 4.9 and use this 
method to get a maximum matching. 


25. Write a computer program to find a maximum matching and a minimum edge 
cover in a bipartite graph (the graph is input data). 



4.5 THE TRANSPORTATION PROBLEM 


In this section we apply our knowledge of spanning trees, network flows, and match- 
ings developed in the previous sections to study network flows in which there is a cost 
charged to flow along an edge. We will consider networks whose underlying graphs 
are bipartite, as in the matching networks in Section 4.4. Similar to the matching 
problems, the edges here have unlimited capacity and the vertices have supplies and 
demands — now integers larger than 1 . We refer to the supply vertices as warehouses 
and the demand vertices as stores. The bipartite graph G = ( X , Y , E) is assumed to 
be complete, with an edge (;, j) from each warehouse W, to each store Sj. What 
is new is that there is a cost Cy charged for shipping an item on edge (i, j). The 
goal is to find a routing of all the items from warehouses to stores that minimizes 
the transportation costs. See the sample problem in Figure 4.17. This optimization 
problem is appropriately called the Transportation Problem. It was one of the first 
optimization problems studied in operations research. 

Warehouse i has a supply of size s , , and store j has a demand of size dj . We 
assume that the total supplies, summed over all warehouses, are adequate to meet the 
demand at all the stores. A solution to a transportation problem specifies the amount 
Xij to ship on each edge (i, j) so that the total shipments leaving each warehouse do 
not exceed the warehouse’s supply and the total shipments arriving at each store equal 
the demand of that store. The goal is to find, among all solutions, a minimum-cost 
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Figure 4.17 
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solution. A mathematical statement of the transportation problem is 

minimize c ij x ij suc h that 

hj 

Y Xij < s, for each ;, Xjj — dj for each j, and x ;/ > 0 

j i 

While spanning trees do not appear anywhere in the statement of the problem, it 
turns out that they are central to solving this problem: The set of edges used for 
shipments in the optimal solution form a spanning tree. Our strategy to find an optimal 
solution will start with an initial (nonoptimal) solution that is a spanning tree. We 
will repeatedly find a cheaper spanning tree solution by dropping one edge from the 
current spanning tree and replacing it with another edge to form a cheaper spanning 
tree solution. 

The data describing a Transportation Problem are usually presented in a table 
called a transportation tableau. See the tableau and associated bipartite graph for a 
sample transportation problem in Figure 4.17. (The graph representation is so clut- 
tered that it is only drawn for small problems.) Our analysis of how to solve the 
transportation problem will be developed in terms of this example. The supplies 
Si of the warehouses appear on the right side of the tableau, and the demands dj 
of the stores appear at the bottom. The shipping cost for edge (i, j) appears 
in the upper right half of entry (j, j) in the tableau. The number x ;/ - in the lower 
left half of entry (i, j) tells how much is shipped from warehouse i to store j . 
[Note: we refer interchangeably to ( i , j) as an edge or an entry in the tableau.] 
The Xij’s in Figure 4.17 are a (nonoptimal) solution for this transportation prob- 
lem. In the graph in Figure 4.17, we have thickened the edges used in the solution. 
The reader can check that the sum of the x,/s in each row i is < .y, and that the 
sum of the x,/s in each column j exactly equals dj. The cost of the sample solu- 
tion in Figure 4.17’s tableau, which uses the four edges (1,1), (2,1), (2,2), (3,2) is 
30 x $4 + 30 x $6 + 30 x $6 + 20 x $6 = $600. 

To simplify the problem, we will assume that the sum of the supplies exactly 
equals the sum of the demands. If the supplies exceed the demands, we can create a 
“dummy” store that takes in all excess supply. In Figure 4.17, the total of the sup- 
plies is 130 and the total of the demands is 110. In this case, we would create a 
dummy store with a demand of 130 — 110 = 20, as shown in the tableau and graph 
in Figure 4.18. The solution in Figure 4.17 has been appropriately modified in Figure 
4.18 so that the sum of the x,/s in row i now exactly equals .y,- . The cost of ship- 
ping a unit along any edge to this dummy store will be 0, since no real costs are 
involved in this gimmick to balance supply and demand. With 0 costs for shipping to 
the dummy store, the total transportation cost does not change from Figure 4.17 to 
Figure 4.18. 

We will now show that it suffices to limit ourselves to solutions involving a set 
of edges that does not contain any circuits. Recall that a set of edges that contains no 
circuit forms a tree or collection of trees. 
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Lemma 1 

Let S be a solution to a transportation problem involving a set of edges E(S) that 
contains a circuit. Then there is another solution S* that costs the same or less than 
S, where E(S*) is a subset of E(S) containing no circuits. 

Proof 

We will not give a formal proof of this Lemma, but rather illustrate it with an exam- 
ple and explain how this example generalizes to all solutions for all transportation 
problems. 

Consider the solution S displayed in Figure 4.18. Observe that the edges 
used for shipments in S, (1,1), (2,1), (2,2), (3,2), (3,3), (1,3), form a circuit; 
see the darkened edges in the graph in Figure 4.18. We can modify S by in- 
creasing the shipments by 1 on the first, third, and fifth edges of this circuit, 
what we call the odd edges of this circuit starting at (1,1), and reducing the shipments 
by 1 on the even edges (the other edges) of this circuit. These balancing changes 
keep the row and column sums of the modified a' (/ ’s the same, and so the modified 
shipments are again a valid solution. 

Let us check how the original cost of the solution changes with this modification 
of S. One more unit along (1,1), (2,2), (3,3) increases the cost by 4 + 6 + 0 (the costs 
for those three edges) = 10. One less unit along (2,1), (3,2), and (1,3) reduces the cost 
by 6 + 6 + 0= 12. The net change is 10 — 12 = —2. Thus, this modification reduces 
the cost of the solution by 2. To get the largest reduction in cost possible by this 
modification, let us increase the shipments in the odd edges of this circuit as much as 
possible, with corresponding decreases on the even edges of the circuit. The critical 
constraint in our modification is that we cannot decrease the shipment in any even 
edge of the circuit below 0. So we examine the shipment levels in the even edges in 
S. The smallest number is 10 on edge (1,3). So we can increase shipment levels by 
10 on the odd edges and reduce them by 10 on the even edges. Note that by driving 
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the level of shipments down to 0 on edge (1,3), we have dropped this edge from the 
subset of edges used in the new solution. Thus, the new solution is circuit-free. The 
new tableau is given in Figure 4.19. The reader should check that the new solution 
in Figure 4.19 has a cost of $580, which, as expected, is $20 less than the cost of the 
solution in Figure 4.18. 

If it had turned out that our modification of increasing shipments by 1 on odd 
edges and decreasing shipments by 1 on even edges had increased the cost, then 
we would reverse the strategy and decrease shipments on odd edges as much as 
possible — until one of the odd edges has a shipment of zero — and increase shipments 
on even edges correspondingly. Also, if our modification resulted in no change in the 
cost, we would still follow the original strategy of increasing odd-edge shipments and 
decreasing even-edge shipments as much as possible in order to get a new solution 
of the same cost that was circuit-free. 

Because the graph for a transportation problem is bipartite, every circuit in the 
graph will be even-length, by Theorem 2 in Section 1.3. So any circuit’s edges can be 
divided into odd and even edges, as done in the preceding example. Then, whenever 
a solution contains a circuit, we can apply the modification of increasing shipments 
in odd edges and decreasing shipments in even edges as much as possible, or the 
reverse strategy. This will produce a new solution that is less costly, or possibly the 
same cost, and which does not contain that circuit. If the original solution contained 
several circuits, the modification would have to be applied repeatedly until all circuits 
were broken. ♦ 

From Lemma 1 , it follows that we only need to consider solutions which contain 
no circuits — that is, solutions which are spanning trees or spanning forests; a spanning 
forest is a disconnected collection of trees incident to all vertices. For the moment, let 
us assume that all circuit-free solutions are spanning trees. In the following discussion, 
we shall explain how to add one or more edges to a spanning forest solution, if one 
arises, to convert it into a spanning tree solution. 
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Our solution strategy will have two phases. First, find some initial spanning tree 
solution by an ad hoc method. Second, repeatedly modify the current spanning tree 
solution to find a cheaper solution, until a cheapest solution is found. The simplest 
way to find such an initial solution is by an ad hoc method called the Northwest 
Corner Rule. In Exercise 8, we present another way of obtaining an initial solution 
that typically is closer to the optimal solution. 

Phase I: Find an Initial Solution by the Northwest Comer Rule 

We will use our sample transportation problem. It turns out that the solution in Figure 

4. 19 is the one obtained by the Northwest Corner Rule. This rule is applied as follows. 

Start at the northwest corner of the tableau — that is, at entry ( 1,1 ). Make xn 
as large as possible. The value will be the minimum of jt and d\ . For our problem, 
xn = min (40, 60) = 40. We have totally used up the supply at warehouse 1, and so 
we delete row one from the tableau and reduce the demand at store 1 to 60 — 40 = 20. 
The resulting modified tableau is shown in Figure 4.20. 

To meet the rest of the demand at store 1 , we turn to the northwest corner in the 
remaining tableau shown in Figure 4.20 — that is, entry (2,7), and make this entry as 
large as possible. So X 2 \ — min (40, 20) = 20. Now we have satisfied the demand 
at store 1, and so we delete column one from the tableau and reduce the supply at 
warehouse 2 to 40. See Figure 4.21. 
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Figure 4.22 
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We continue the procedure of repeatedly making the entry in the northwest corner 
of the remaining tableau as large as possible. The complete Northwest Corner Rule 
solution is shown in Figure 4.19, which for convenience is displayed in Figure 4.22. 

Note that if the value we assign the current northwest entry in the current tableau 
uses up the supplies at the first (remaining) warehouse and also satisfies the demand 
of the first remaining store, then we would have to delete both the first row and the 
first column of the current tableau. This will lead to a disconnected set of edges in the 
solution — that is, a spanning forest. To avoid this outcome, we arbitrarily keep either 
the first row or first column, although its supplies or demand is 0. 

Next we explain the second phase of how to improve the current solution. There 
are three steps to determining the improvement. 


Phase II: Finding a Better Solution 

Step II.A. Determining Selling Prices at Warehouses and Stores 

At this point, the initial solution is the Northwest Corner Rule solution in Figure 
4.22. We now introduce “selling prices” for the commodity at the warehouses and 
stores, based on the transportation costs in the initial solution. To start the pricing 
process, we need to pick an arbitrary price u\ for the commodity at warehouse 1, say 
ii | = $10. We use edge (1,1) in our initial solution to transport the commodity from 
warehouse 1 to store 1 at a cost of cn = $4 per unit. Thus, if we buy the commodity 
at warehouse 1 for $10 and ship it for $4 to store 1, the commodity should sell for 
Vi = mi + cn = $10 + $4 = $14 at store 1. 

Store 1 also receives shipments from warehouse 2 with a transportation cost 
of c 2 i = $6 per unit. Now we reverse the reasoning used to determine the price at 
warehouse 1 from the price at store 1. Given that the commodity’s price at store 1 
is $14 and it costs $6 to ship from warehouse 2 to store 1, the price at warehouse 2 
should be «2 = ih — Qi = $14 — $6 = $8. Warehouse 2 also ships to store 2 at a cost 
of C 22 = $6 so the price at store 2 should be u 2 = “2 + C 22 = $8 + $6 = $14. Using 
the shipping costs on the edges in our current spanning tree solution, we can continue 


170 Chapter 4 Network Algorithms 


this process to calculate selling prices u 3 , Vt for the commodity at warehouse 3 and 
at store 3, respectively. 

The complete set of selling prices is 

warehouse 1: u 1 = $10 store 1: Ui = $14 

warehouse 2 : m 2 = $8 store 2 : u 2 = $14 ( 1 ) 

warehouse 3: M 3 = $8 dummy: V 3 = $8 

Summarizing, we set u 1 equal to any arbitrary value. Then we determine successive 
Ui ’s and Vj ’s, as in the example, using the conditions that Vj — u -, = ctj , for each edge 
(i, j ) in the current solution. Note that the calculation of prices in this fashion is only 
possible if the edges used in the solution do not form a circuit; for details of this claim, 
see Exercise 9. 

Lemma 2 

Let S be a spanning tree solution of a transportation problem using edge set E(S), and 
let m, and Vj be prices at warehouse i and store j, respectively, based on solution S 
according to Step II. A. The transportation cost of this solution, JT ■ Cjjxi j, summed 
over edges (i, j) in E(S), equals the income from the sale of the stores’ demands 
at the stores’ prices minus the cost of the warehouses’ supplies at the warehouses’ 
prices. That is, 

CijXij = v i d ’ - H UiSi {2) 

i,j j i 


Proof 

For each edge ( i , j) in E(S), the prices are determined by the condition that c, ; - = 
Vj — n, . Then 

C; j Xi j — (vj Ui )xij (3) 

When one sums (3) over all edges in E(S), the total of the shipments x,- ; out of 
warehouse i is s, and the total of the shipments into store j is . Thus, the right-hand 
side of the sum of the (3)s over all edges in E(S) is equal to the right-hand side of 
(2). The left-hand side of this sum of (3)s is clearly the left-hand side of (2). ♦ 

To illustrate Lemma 2, for the solution in Figure 4.22 with the associated prices 
in (1), the right side of (2) is 

($14 x 60 + $ 14 x 50 + $8 x 20) - ($10 x 40 + $8 x 60 + $8 x 30) 

= $1700 — $1120 = $580 

As noted earlier, $580 is the sum of the transportation costs for this solution. 

Step 11. B. Determming Which Edge to Add to the Current Solution 
We now look at the edges that are not used in the current solution. Consider edge 
(7,2) with cost ci 2 = $2. Warehouse l’s selling price is $10 and Store 2’s selling price 
is $14. However, if we buy the commodity at warehouse 1 for $10 and ship it on edge 
(7,2) for $2, we can sell it at store 2 for $10 + $2 = $12, reducing the selling price at 
store 2 by $2 per item. A reduced price at store 2 will cause a reduced profit — store 
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sales minus warehouse purchases — which by Lemma 2 equals a reduced transporta- 
tion cost. This means that a cheaper solution can be obtained by incorporating edge 
(7,2) into the solution. [To maintain a spanning tree solution when edge (7,2) is 
added, some edge in the current solution would have to be dropped — a choice made in 
Step II. C.] Before using edge (7,2), we check the other edges not in the current 
solution to see how much each of them could reduce the profit. The results are 
summarized below. 


edge (7, 2): C12 = $2 < i>2 — wi = $14 — $10 = $4 decrease of $2 

edge (7, 3)\ C13 = $0 > i>3 — ii\ = $8 — $10 = — $2 increase of $2 

edge (2, 3): czs = $0 = Vt, — ui = $8 — $8 = $0 no change 

edge ( 3, 7): C31 = $7 > t>\ — 1/3 = $14 — $8 = $6 increase of $1 


(4) 


We see that only edge (7,2) yields a reduction. So we add edge (7,2) to the current 
solution. If there were no edge that reduces the cost of the solution, then the current 
solution is optimal and we are finished. 

Step II. C. Determining a Cheaper Spanning Tree Solution 

In Step II. B, an edge is chosen to be added to the current solution. For the example in 
Figure 4.22, the choice is edge (7,2). When this edge is added to the set of edges in 
the current solution, we have a unique circuit, by Exercise 28 in Section 3.2. For 
the solution in Figure 4.22, the circuit is (7,2), (2,2), (2,7), (7,7). As noted in the 
proof of Lemma 1 above, we get a cheaper solution by increasing the flow in the 
odd edges, (7,2) and (2,7), as much as possible while decreasing the flow in the even 
edges, (2,2) and (7,7), correspondingly. The net reduction is C12 + C21 — cn — C22 = 
2+6 — 4— 6 = —2. Note that this calculation confirms our previous analysis that we 
will save $2 for each unit shipped on edge (7,2). 

The limiting constraint is when the shipment in an even edge decreases to 0. In 
our example, since the current shipment in both x n and X22 is 40, we can increase the 
shipments in (2,7) and (7,2) by 40 and reduce the shipments in (1,1) and (2,2) by 40. 
The new solution, cheaper by 40 x $2 = $80 than the previous solution, is shown in 
Figure 4.23. 
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Note that although the flow in edges (1,1) and (2,2) both decreased to 0, we cannot drop 
both edges. If we did, the new solution would not be a spanning tree. We arbitrarily 
pick one of (1,1) and ( 2 , 2 ) to stay in the new solution but with a shipment level of 0 . 
In Figure 4.23, we picked (2,2). 

Now we repeat the three parts of Phase II to find a still better solution. 
In Step II. A, using the solution in Figure 4.23, we compute new prices at warehouses 
and stores. Starting with u\ = $10 and using edge (1,2), we find i >2 = u\ +<h 2 = 
$10 + $2 = $12. Continuing as before, we obtain the following set of prices: 

warehouse 1: u\ = $10 store 1: V\ — $12 

warehouse 2: W 2 = $6 store 2: Vi — $12 (5) 

warehouse 3: u 3 = $6 dummy: V3 = $6 

In Step II. B, we see if any edges not in the current solution produce a reduction in 
the transportation costs. The calculations yield 

edge (1,1): cn = $4 > iq — u\ = $12 — $10 = $2 increase of $2 

edge (1,3): C 13 = $0 > V 3 — u\ = $6 — $10 = — $4 increase of $4 

edge (2,3): C 23 = $0 = V 3 — M 2 = $6 — $6 = $0 no change 

edge (3,1): C 31 = $7 > tq — M 3 = $12 — $6 = $6 increase of $1 

Since none of these edges reduces the cost of the current solution, the current 
solution is optimal, and we are finished. 



EXERCISES 


Exercises 1-7 ask for the solution of transportation problems. Exercise 8 
presents a better starting rule. Exercise 9 examines why prices cannot be uniquely 
determined if the solution contains a circuit. 


1. Solve the following transportation problems in which warehouse 1 has 30 units 
and warehouse 2 has 30 units and in which store 1 needs 20 units and store 2 
needs 40 units. The tables below give the transportation costs. 


(a) Store 

1 

2 

(b) Store 

1 

2 

Warehouse 1 

$6 

$2 

Warehouse 1 

$4 

$3 

2 

$4 

$3 

2 

$6 

$4 


2. Solve the following transportation problems in which warehouse 1 has 50 units 
and warehouse 2 has 20 units and in which store 1 needs 20 units and store 2 
needs 30 units. The tables below give the transportation costs. Note that a dummy 
store will be needed for the excess supplies. 

(a) Store 1 2 (b) Store 1 2 

Warehouse 1 $8 $4 Warehouse 1 $4 $4 

2 $3 $6 2 $5 $7 
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3 . Solve the following transportation problems in which warehouse 1 has 30 units, 
warehouse 2 has 30 units, and warehouse 3 has 30 units and in which store 1 needs 
40 units and store 3 needs 50 units. The tables below give the transportation costs. 


(a) Store 


1 

2 

(b) Store 


1 

2 

Warehouse 

1 

$4 

$6 

Warehouse 

1 

$5 

$4 


2 

$5 

$3 


2 

$6 

$5 


3 

$6 

$8 


3 

$3 

$7 


4 . Solve the following transportation problems in which warehouse 1 has 30 units, 
warehouse 2 has 30 units, and warehouse 3 has 30 units and in which store 1 
needs 40 units and store 2 needs 40 units. The tables below give the transportation 
costs. Note that a dummy store will be needed for the excess supplies. 


(a) Store 


1 

2 

(b) Store 


1 

2 

Warehouse 

1 

$5 

$2 

Warehouse 

1 

$8 

$3 


2 

$9 

$5 


2 

$4 

$5 


3 

$4 

$8 


3 

$4 

$9 


5 . Solve the following transportation problems in which warehouse 1 has 30 units, 
warehouse 2 has 30 units and warehouse 3 has 30 units and in which store 1 
needs 20 units, store 2 needs 20 units, and store 3 needs 50 units. The tables 
below give the transportation costs. 


(a) Store 


1 

2 

3 

(b) Store 


1 

2 

3 

Warehouse 

1 

$4 

$6 

$5 

Warehouse 

1 

$8 

$2 

$4 


2 

$2 

S4 

$6 


2 

$4 

$6 

$4 


3 

$5 

$3 

$7 


3 

$5 

$4 

$5 


6. Solve the following transportation problems in which warehouse 1 has 40 units, 
warehouse 2 has 30 units, and warehouse 3 has 50 units and in which store 1 
needs 50 units, store 2 needs 10 units, and store 3 needs 40 units. The tables 
below give the transportation costs. A dummy store will be needed. 


(a) Store 


1 

2 

3 

(b) Store 


1 

2 

3 

Warehouse 

1 

$7 

$2 

$5 

Warehouse 

1 

$8 

$3 

$2 


2 

S3 

$5 

$4 


2 

$3 

$5 

$7 


3 

$4 

$6 

$3 


3 

$6 

$4 

$5 


7 . Solve the following transportation problems in which warehouse 1 has 30 units, 
warehouse 2 has 30 units, warehouse 3 has 30 units, and warehouse 4 has 30 
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units, and in which store 1 needs 40 units, store 2 needs 30 units, and store 3 
needs 50 units. The tables below give the transportation costs. 


( a ) Store 


1 

2 

3 

( b ) Store 


1 

2 

3 

Warehouse 

1 

$4 

$3 

$6 

Warehouse 

1 

$7 

$5 

$3 


2 

$8 

$7 

$4 


2 

$2 

$4 

$6 


3 

$6 

$3 

$7 


3 

$5 

$6 

$5 


4 

$7 

$3 

$8 


4 

$7 

$3 

$3 


8. This exercise presents a better initial spanning tree solution, called the Minimum- 
Cost Rule. Instead of picking the northwest corner entry at each stage, this 
method picks the minimum-cost entry in the current tableau. For the tableau 
in Figure 4.18, the minimum cost entries are (1,3), (2,3), and (3,3), all of 
cost $0. In the case of a tie, we can pick any entry. Suppose we pick (1,3). 
The Minimum-Cost Rule ships as much as possible along edge (1,3) — that is, 
jc 13 = min (40, 20) = 20. Now we delete the column for the dummy store, since 
its demand has been met, and reduce the supplies at warehouse 1 to 60 — 20 = 40. 
The minimum cost entry in the reduced tableau is (1,2) with a cost of $2. So we 
ship as much as possible along edge (1,2). Then xn = min(40, 50) = 40. Now 
we delete the row for the warehouse 1, since its supplies have been used, and 
reduce the demand at Store 2 to 50 — 40 = 10. We continue in this fashion. 

Solve the following transportation problems using the Minimum-Cost Rule: 

( a ) Exercise 4(a) 

( b ) Exercise 5(a) 

( c ) Exercise 6(a) 

9. Apply Step II. A to the tableau in Figure 4.18 to try to determine selling prices 
at warehouses and stores. Since the solution in this tableau uses edges forming 
a circuit, show that there are two routes which can be used to determine the 
selling price at store 3. Are the selling prices at store 3 determined from the two 
different routes the same? 



4.6 SUMMARY AND REFERENCES 


In this chapter we presented algorithms for three basic network optimization problems : 
shortest path, minimum spanning trees, maximum flow, and the transportation prob- 
lem. Principal emphasis was placed on a thorough discussion of maximum flows. We 
showed how these flows could be applied to a wide variety of other network problems. 
In Section 4.4 we used flow models to develop a combinatorial theory of matching. 
All the material about flows in this chapter is discussed in greater detail in the pioneer- 
ing work Flows in Networks by Ford and Fulkerson [2] and in Network Flow Theory 
by Ahuja et al. [1]. We have omitted discussion of the speed of these algorithms. 
The interested reader is referred to Ahuja et al. [1] for efficient implementations of 
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the shortest path and maximum flow algorithms. Modern maximum flow algorithms 
require less than Off) operations for an n- vertex network. 

It is often natural in network flow problems to have costs associated with edges 
so that when many possible maximum flows exist, one can ask for a least-cost maxi- 
mum flow. Such problems are called trans-shipment and transportation problems. 
Similarly, in a matching problem with many solutions ( A- matchings), one can ask for 
a least-cost matching. Such a problem is called an assignment problem. Efficient al- 
gorithms exist for all these minimization problems (see [1], [2], or [3]). Furthermore, 
any flow optimization problem, with or without the abovementioned minimization, is 
a problem of optimizing a linear function of the edge flows subject to linear equalities 
and inequalities, such as the flow constraints (a), (b), and (c) of Section 4.3. Such a 
constrained linear optimization problem is called a linear program. Linear program- 
ming is a principal tool of operations research, and good algorithms exist for solving 
linear programs. However, it is much more efficient to solve network problems with 
the network-specific algorithms presented in this chapter. 

1. R. Ahuja, T. Magnanti, and J. Orlin, Network Flow Theory, Algorithms and 
Applications, Prentice-Hall, Englewood Cliffs, NJ, 1993. 

2. L. Ford and D. Fulkerson, Flows in Networks, Princeton University Press, 
Princeton, NJ, 1962. 

3. F. Hillier and G. Lieberman, Introduction to Operations Research, Holden Day, 
San Francisco, 1988. 

4. B. Schwartz, “Possible winners in partially completed tournaments,” SIAM 
Review 8 (1966), 302-308. 
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CHAPTER 5 

GENERAL COUNTING 
METHODS FOR 
ARRANGEMENTS 
AND SELECTIONS 



TWO BASIC COUNTING PRINCIPLES 


This is the most important chapter in this book. It develops the fundamental skills of 
combinatorial reasoning. We present a few basic formulas, involving permutations 
and combinations, most of which the reader has seen before. Then we examine a 
variety of word problems involving counting and show how they can be broken down 
into sums and products of simple numerical factors. Having read through the examples 
as passive readers, students next must assume the active role of devising solutions on 
their own to the exercises. The first exercises at the end of each section are similar 
to the examples discussed in the section. The later exercises, however, have little in 
common with the examples except that they require the same general types of logical 
reasoning, clever insights, and mathematical modeling. Facility with these three basic 
skills in problem solving, as much as an inventory of special techniques, is the key 
to success in most combinatorial applications. In sum, for many students, this will be 
the most challenging and most valuable chapter in this book. 

Subsequent chapters in the enumeration part of this book develop specialized 
theory and techniques that considerably simplify the solution of specific classes of 
counting problems. The usefulness of this theory is particularly evident after one has 
grappled with counting problems in this chapter using only first principles. Unfortu- 
nately, there is no such theory to assist in the solution of most real-world counting 
problems. 

In order to motivate our problem solving, we mention applications to probability 
and statistics, computer science, operations research, and other disciplines. However, 
the details of the counting problems arising from such applications often appear 
tedious if one is not actively working in the area of application. So instead, we will 
base many of the worked-out examples (and exercises) on recreational problems, such 
as poker probabilities. The solution of these problems requires the same mathematical 
skills used in more substantive applications. 
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The problem-solving skills mentioned above are building blocks in a more general 
strategy: asking the right questions about how to solve a problem. These questions 
range over a number of issues, such as what techniques to use, how to break the 
problem into manageable pieces, what unseen challenges are lying beneath the surface 
in the statement of the problem, what the first step is in solving the problem, and many 
more. For example, a critical question we repeatedly must ask at the start of each 
counting problem is: are the outcomes to be enumerated in this problem unordered 
sets (combinations), or ordered sets (sequences)? It is typically much harder to ask 
the right questions than it is to find the answers to these questions. 

When an instructor or textbook presents the solution to a new type of counting 
problem, the steps in the solution may not be too difficult to follow, but the hard part of 
the solution — knowing what the right steps are — is frequently never discussed. This 
text will try to give proper attention to asking the right questions as well as answering 
these questions. 

This section starts with two elementary but fundamental counting principles 
whose simplicity masks both their power and the ease with which they can be misused. 



The Addition Principle 

If there are rj different objects in the first set, r 2 different objects in the 

second set, . 

. , and r m different objects in the mth set, and if the different sets 

are disjoint, then the number of ways to select an object from one of the m sets 

is n + r 2 + - 



The Multiplication Principle 


Suppose a procedure can be broken into m successive (ordered) stages, 
with r\ different outcomes in the first stage, r 2 different outcomes in the second 
stage, . . . , and r m different outcomes in the with stage. If the number of outcomes 
at each stage is independent of the choices in previous stages and if the com- 
posite outcomes are all distinct, then the total procedure has r\ x r 2 x • • • x r m 
different composite outcomes. 


Remember that the addition principle requires disjoint sets of objects and the 
multiplication principle requires that the procedure break into ordered stages and 
that the composite outcomes be distinct. The validity of these two principles follows 
directly from the definitions of addition and multiplication of integers. That is, the 
sum a + b is the number of items resulting when a set of a items is added to a set of 
b items; and the product a x bis the number of sequences (A,B), when A can be any 
of a items and B can be any of b items. The two principles are standard mth-order 
extensions of these two binary operations. 
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Example 1: Rolling Dice 

Two dice are rolled, one green and one red. Each die has faces numbered 1 through 6. 

(a) How many different outcomes of this procedure are there? 

(b) What is the probability that there are no doubles (not the same value on both 
dice)? 

(a) The question one needs to ask to get started is: what is an outcome 
in this problem? An outcome is actually a composite outcome of two smaller 
outcomes — namely, the number appearing on the green die when it is rolled and 
the number appearing on the red die. 

To count the number of outcomes for a problem, it is essential to have a way 
to list outcomes on a piece of paper. In this case, a natural way to express the 
outcomes is ordered pairs of numbers, where the first number is the red die’s value 
and the second number the green die’s value. Then the collection of outcomes 
will be all 2-number sequences with each number between 1 and 6 (inclusive). 
Thus, there are 6 x 6 outcomes. 

(b) We calculate the probability of no doubles using the probability formula 
for an event: the size of the subset of outcomes producing the desired event 
divided by the number of all possible outcomes (see Appendix A. 3 for details). 
The denominator in the probability fraction is 36, the total number of outcomes 
determined in part (a). To get the numerator — the number of outcomes with no 
doubles — we again use the multiplication principle. The constraint “no doubles” 
must be recast as “the value of the green die is different from the value of the red 
die.” Once the red die is rolled, then there are just five permissible other values 
for the green die, independent of the particular value of the red die. So there 
are 6 x 5 = 30 outcomes with no doubles, and the probability of no doubles is 
30/36 = 5/6. ■ 


The number of no-doubles in Example 1 (b) could also be answered by subtracting 
the six outcomes of doubles from all 36 outcomes, yielding 36 — 6 = 30 outcomes 
with no doubles. 

Example 2: Arranging Books 

There are five different Spanish books, six different French books, and eight different 
Transylvanian books. How many ways are there to pick an (unordered) pair of two 
books not both in the same language? 

The outcomes are unordered pairs of books. “Unordered” means that there is not 
a first book in the pair, and so we cannot break the outcomes into a first stage (first 
book) and a second stage — that is, the multiplication principle does not apply. So we 
face the question of how we recast or decompose this problem in such a way that 
the multiplication or additional principle can be used. Typically when we face this 
challenge, we need to break the problem into smaller parts to which the principles 
apply. This raises the question of what the parts are into which we should break the 
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problem. Once we get the right parts, the solution to each part is usually easy to 
find. If one recognizes a similarity between the analysis of the current problem and a 
previously solved problem, then the right parts might be found by analogy. Without 
such help, the best strategy is to pick a special case of the problem that is easy to 
solve. Then solve another special case and look for a pattern for solving the other 
cases. 

One possible case in this problem is when the two books consist of a Spanish and a 
French book. This case has two stages: first, pick a Spanish book in five ways, and then 
pick a French book in six ways. The multiplication principle applies giving an answer 
of 5 x 6. It is not hard to see that the other two cases — (i) a Spanish and a Transylvanian 
book and (ii) a French and a Transylvanian book — will work out similarly with 
answers of 5 x 8 and 6x8, respectively. The outcomes in three cases are clearly 
disjoint, and so the addition principle can be used to add the numbers of outcomes 
in the three cases to get the total number of pairs of books: 5x6+5x8+6x8= 
30 + 40 + 48= 118. ■ 


The preceding example typifies a basic way of thinking in combinatorial problem 
solving: find a small enough special case for one to solve, then use the same solution 
method on the other cases, and add up the answers to all the special cases. There may 
be cleverer ways to solve the problem, but if we can reduce the original problem to 
a set of subproblems with which we are familiar, then we are less likely to make a 
mistake. 


Example 3: Sequences of Letters 

How many ways are there to form a three-letter sequence using the letters a, b, 
c, d, e, f (a) with repetition of letters allowed? (b) without repetition of any letter? 
(c) without repetition and containing the letter el (d) with repetition and containing el 

(a) With repetition, we have six choices for each successive letter in the sequence. 
So by the multiplication principle there are 6x6x6 = 216 three-letter 
sequences with repetition. 

(b) Without repetition, there are six choices for the first letter. For the second 
letter, there are five choices, corresponding to the five remaining letters (what- 
ever the first choice was). Similarly for the third letter, there are four choices. 
Thus there are 6x5x4 = 120 three-letter sequences without repetition. 

(c) It is often helpful to make a diagram displaying the positions in a sequence, 
even a sequence with just three letters: 


Such a diagram helps focus on choices involving the positions. Now we 
face a hard question in starting to solve this problem. How do we recast the 
constraint of at least one e in the sequence into a form that can be analyzed 
in terms of the addition and multiplication principles? There are typically 
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several different ways to analyze a new constraint like this. Here we employ 
the case-by-case approach that we used in Example 2. One appealing version 
of this approach is to consider the different places where there must be an e. 
As the following diagrams show, the e could be in the first position or the 
second position or the third position: 

e_ _ _e _ e_ 

In each diagram, there are five choices for which of the other five let- 
ters (excluding e) goes in the first remaining position and four choices for 
which of the remaining four letters goes in the other position. Thus there are 
5x4+5x4 + 5x4 = 60 three-letter sequences with e. Another approach 
to solving this problem is given in Exercise 19. 

(d) Let us try the approach used in part (c) when repetition is allowed. As before, 
there are three choices for e’s position. For any of these choices for e’s 
position, there are 6x6 = 36 choices for the other two positions, since e and 
the other letters can appear more than once. But the answer of 36 + 36 + 36 = 
108 is not correct. 

The addition principle has been violated because the outcomes in the 
three cases of where the e must be are not distinct. Consider the sequence 

e_ £ e_ 

It was generated two cases: once when e was put in the first position followed 
by c e as one of the 36 choices for the latter two positions, and a second time 
when e was put in the last position with e c in the first two positions. 

We must use an approach for breaking the problem into parts that ensures 
distinct outcomes. Let us decompose the problem into disjoint cases based 
on where the first e in the sequence occurs. First suppose the first e is in the 
first position: 


e •_ 

Then there are six choices (including e ) for the second and for the third 
positions — 6 x 6 ways. 

Next suppose the first e is in the second position: 

_ e_ _ 

noe 

Then there are five choices for the first position (cannot be e) and six choices 
for the last position — 5 x 6 ways. Finally, let the first (and only) e be in the last 
position: 


_ e_ 

no e no e 

There are five choices each for the first two positions — 5 x 5 ways. The 
correct answer is thus (6 x 6) + (5 x 6) + (5 x 5) = 91. ■ 
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The hardest part about solving most counting problems is finding a structure 
in the problem that allows it to be broken into subcases or stages. In other words, 
the difficulty is in “getting started.” At the same time, one must be sure that the 
decomposition into cases or stages generates outcomes that are all distinct. 

Many counting problems require their own special insights. For such problems, 
knowing the solution to problem A is typically of little help in solving problem B. This 
skill cannot be acquired in reading textbook examples. It is only gained by working 
many problems. 

The following example illustrates this type of special insight in combinatorial 
problem solving. 


Example 4: Nonempty Collections 

How many nonempty different collections can be formed from five (identical) apples 
and eight (identical) oranges? 

Readers with some experience in combinatorial problem solving may want to 
break the problem into subcases based on the number of objects in the collections. 
Any one of these subcases can be counted quite easily, but there are 13 possible 
subcases; that is, collections can have 1 or 2 or ... up to 13 pieces of fruit. 

In counting different possibilities, we must concentrate on what makes one col- 
lection different from another collection. The answer is, the number of apples and/or 
the number of oranges will be different in different collections. Then we can charac- 
terize any collection by a pair of integers (a, o), where a is the number of apples and 
o is the number of oranges. 

Now the number of collections is easy to count. There are six possible values 
for a (including 0) — 0, 1, 2, 3, 4, 5 — and nine possible values for o. Together there 
are 6 x 9 = 54 different collections. (Note that we multiply 6 and 9, not add them, 
because a collection combines any number of apples and any number of oranges; we 
add if we want to count the ways to get some amount of apples or some amount of 
oranges, but not both.) 

Since the problem asked for nonempty collections and one of the possibilities 
allowed was (0,0), the desired answer is 54 — 1 = 53. ■ 


Here is one piece of advice to consider when one is stuck and cannot get started 
with a problem. Try writing down in a systematic fashion some (a dozen or so) of the 
possible outcomes. In listing outcomes, one should start to see a pattern emerge. Think 
of the list as being part of one particular subcase. Then ask: how many outcomes would 
the list need to include to complete that subcase? Next ask what other (hopefully 
similar) subcases need to be counted. Once a first subcase has been successfully 
analyzed, other subcases are usually easier. 

We summarize the two key facets we have encountered in combinatorial problem- 
solving. First, we must find a way to recast the constraints in a problem so that some 
combination of the addition and multiplication principles can be applied. Second, to 
use these principles we must find a way to break the problem into pieces or stages, and 
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be sure that the outcomes in the different pieces are distinct. As noted at the outset, 
these concerns are instances of the general challenge of asking the right questions in 
problem-solving. 


5.1 EXERCISES 



' Summary of Exercises The early exercises are straightforward. 
Then the exercises become more challenging and require analysis that will be different 
for each problem. For the harder problems, readers must devise their own method of 
solution rather than mimic a method used in one of the text’s examples. All exercises 
should be read carefully two times to avoid misinterpretation. 

Some problems include possible answers for comment. Try to infer from each 
expression the reasoning that would have generated such an answer. If the answer is 
wrong, point out the mistake in the reasoning. 

The word “between” is always used in the inclusive sense; that is, “integers 
between 0 and 50” means 0, 1,2,..., 49, 50. The probability of a particular outcome 
is the number of such particular outcomes divided by the number of all outcomes. 
See Appendix A. 3 for more about probability. 

1. (a) How many ways are there to pick a sequence of two different letters of the 

alphabet that appear in the word CRAB? In STATISTICS? 

(b) How many ways are there to pick first a vowel and then a consonant from 
CRAB? From STATISTICS? 

2. (a) How many integers are there between 0 and 50 (inclusive)? 

(b) How many of these integers are divisible by 2? 

(c) How many (unordered) pairs of these integers are there whose difference 


is 5? 


3. A store carries eight styles of pants. For each style, there are 12 different possible 
waist sizes, five pants lengths, and four color choices. How many different types 
of pants could the store have? 

4. How many different sequences of heads and tails are possible if a coin is flipped 
100 times? Using the fact that 2 10 = 1024 ~ 1000 = 10 * 1 2 3 4 5 6 7 , give your answer in 
terms of an (approximate) power of 10. 

5. How many six-letter “words” (sequence of any six letters with repetition) are 
there? How many with no repeated letters? 

6. How many ways are there to pick a man and a woman who are not husband and 
wife from a group of n married couples? 

7. Given 10 different English books, six different French books, and four different 
German books, 

(a) How many ways are there to select one book? 

(b) How many ways are there to select three books, one of each language? 
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(c) How many ways are there to make a row of three books in which exactly one 
language is missing (the order of the three books makes a difference)? 

8. There are seven different roads between town A and town B, four different roads 
between town B and town C, and two different roads between town A and town C. 

(a) How many different routes are there from A to C altogether? 

(b) How many different routes are there from A to C and back (any road can be 
used once in each direction)? 

(c) How many different routes are there from A to C and back in part (b) that 
visit B at least once? 

(d) How many different routes are there from A to C and back in part (b) that do 
not use any road twice? 

9. How many ways are there to pick two different cards from a standard 52-card 
deck such that 

(a) The first card is an Ace and the second card is not a Queen? 

(b) The first card is a spade and the second card is not a Queen? {Hint: Watch 
out for the Queen of spades.) 

10. How many nonempty collections of letters can be formed from four As and 
eight Bs? 

11. How many ways are there to roll two distinct dice to yield a sum evenly divisible 
by 3? 

12. How many six-letter “words” (sequences of letters with repetition) are there in 
which the first and last letter are vowels? In which vowels appear only (if at all) 
as the first and last letter? 

Comment on these possible answers to the second part: (a) 5 2 21 4 , (b) 5 2 26 4 , 

(c) 21 2 26 4 , and (d) 26 6 - 21 2 26 4 . 

13. (a) How many different six-digit numbers are there (leading zeros, e.g., 00174, 

not allowed)? 

(b) How many even six-digit numbers are there? 

(c) How many six-digit numbers are there with exactly one 3? 

(d) How many six-digit palindromic numbers (numbers that are the same when 
the order of their digits is inverted, e.g., 137731) are there? 

14. How many different numbers can be formed by various arrangements of the six 
digits 1, 1, 1, 1,2, 3? 

15. What is the probability that the top two cards in a shuffled deck do not form a 
pair? 

16. (a) How many different outcomes are possible when a pair of dice, one red and 

one white, are rolled two successive times? 

(b) What is the probability that each die shows the same value on the second roll 
as on the first roll? 
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(c) What is the probability that the sum of the two dice is the same on both rolls? 

(d) What is the probability that the sum of the two dice is greater on the second 
roll? 

17. A rumor is spread randomly among a group of 10 people by successively having 
one person call someone, who calls someone, and so on. A person can pass the 
rumor on to anyone except the individual who just called. 

(a) By how many different paths can a rumor travel through the group in three 
calls? In n calls? 

(b) What is the probability that if A starts the rumor, A receives the third calls? 

(c) What is the probability that if A does not start the rumor, A receives the third 
call? 

18. (a) How many different license plates involving three letters and two digits are 

there if the three letters appear together either at the beginning or end of the 
license? 

(b) How many license plates involving one, two, or three letters and one, 
two, or three digits are there if the letters must appear in a consecutive 
grouping? 

19. Re-solve the problem in Example 3 of counting the number of three-letter se- 
quences without repetition using a , b, c, d, e, f that have an e by first counting 
the number with no e. 

20. What is the probability that the sum of two randomly chosen integers between 
20 and 40 inclusive is even (the possibility of the two integers being equal is 
allowed)? 

Comment on the answers (a) 1/2, (b) 11/21, and (c) 11 2 /21 2 . 

21. How many three-letter sequences without repeated letters can be made using 
a, b, c, d, e, f in which either e or / (or both) is used? 

Comment on the answers (a) 3 x 2 x 4 x 3, (b) 3 x 2 x 5 x 4, (c) 3 x 2 x 
4x4 — 3x2x4, and (d) 6x5x4 — 4x3x2. 

22. How many ternary (0, 1, 2) sequences of length 10 are there without any pair of 
consecutive digits the same? 

23. How many integers between 1 ,000 and 10,000 are there with (make sure to avoid 
sequences of digits with leading 0s): 

(a) Distinct digits? 

(b) Repetition of digits allowed but with no 2 or 4? 

(c) Distinct digits and at least one of 2 and 4 must appear? 

24. How many 1 2-digit decimal sequences are there that start and end with a sequence 
of at least two 3s? 

25. How many sequences of length 5 can be formed using the digits 0, 1,2, . . . , 9 
with the property that exactly two of the 10 digits appear, e.g., 05550? 
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26 . What is the probability that an integer between 0 and 9,999 has exactly one 8 
and one 9? 

Comment on the answers (a) 4x3/10 4 , (b) 4 x 3 x 8 x 7/1 0 4 , and 
(c) 2 x 4 x 3 x 8 2 /10 4 . 

27 . How many different five-letter sequences can be made using the letters A, B, C, 
D with repetition such that the sequence does not include the word BAD — that 
is, sequences such as A BAD D are excluded. 

28 . (a) How many election outcomes are possible with 20 people each voting for 

one of seven candidates (the outcome includes not just the totals but also 
who voted for each candidate)? 

(b) How many election outcomes are possible if only one person votes for can- 
didate A and only one person votes for candidate D? 

29. There are 15 different apples and 10 different pears. How many ways are there 
for Jack to pick an apple or a pear and then for Jill to pick an apple and a pear? 

30 . How many times is the digit 5 written when listing all numbers from 1 to 100,000? 

Comment on the answers (a) 4, (b) 5 x 10 4 , and (c) 1 + 10 + 100 + 1000. 

31 . How many times is “25” written when listing all numbers from 1 to 100,000? 
(This is an extension of the previous exercise.) 

32 . What is the probability that if one letter is chosen at random from the word 
RECURRENCE and one letter is chosen from RELATION, the two letters are 
the same? 

33. How many four-digit numbers are there formed from the digits 1, 2, 3, 4, 5 (with 
possible repetition) that are evenly divisible by 4? 

34 . How many nonempty collections of letters can be formed from n As, n Bs, n Cs 
and n Ds? 

35. There are 50 cards numbered from 1 to 50. Two different cards are chosen at 
random. What is the probability that one number is twice the other number? 

36 . If two different integers between 1 and 100 inclusive are chosen at random, what 
is the probability that the difference of the two numbers is 15? 

37. If three distinct dice are rolled, what is the probability that the highest value is 
twice the smallest value? 

38 . How many different numbers can be formed by the product of two or more of 
the numbers 3, 4, 4, 5, 5, 6, 7, 7, 7? 

39. There are 10 different people at a party. How many ways are there to pair them 
off into a collection of five pairings? 

40 . A chain letter is sent to five people in the first week of the year. The next week 
each person who received a letter sends letters to five new people, and so on. 
How many people have received letters in the first five weeks? 

41 . How many ways are there to place two identical rooks in a common row or 
column of an 8 x 8 chessboard? an n x m chessboard? 
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42 . How many ways are there to place two identical kings on an 8 x 8 chessboard so 
that the kings are not in adjacent squares? on an n x m chessboard? 

43 . How many ways are there to place two identical queens on an 8 x 8 chessboard 
so that the queens are not in a common row, column, or diagonal? 

44 . How many different positive integers can be obtained as a sum of two or more 
of the numbers 1, 3, 5, 10, 20, 50, 82? 

45 . How many ways are there for a man to invite some (nonempty) subset of his 10 
friends to dinner? 

46 . How many different rectangles can be drawn on an 8 x 8 chessboard (the rectan- 
gles could have sides of length 1 through 8; two rectangles are different if they 
contain different subsets of individual squares)? 

47 . How many ways are there to place a red checker and a black checker on two 
black squares of a checkerboard so that the red checker can jump over the black 
checker? (A checker jumps on the diagonal from in front to behind.) 

48 . Use induction to verify the following formally: 

(a) The addition principle 

(b) The multiplication principle 

49 . On the real line, place n white pegs at positions 1,2, . ... n and n blue pegs at 
positions — 1, —2, . . . , — n (0 is open). Whites move only to the left, blues to the 
right. When beside an open position, a peg may move one unit to occupy that 
position (provided it is in the required direction). If a peg of one color is in front 
of a peg of the other color that is followed by an open position (in the required 
direction), a peg may jump two units to the open position (the jumped peg is not 
removed). By a sequence of these two types of moves (not necessarily alternating 
between white and blue pegs), one seeks to get the positions of the white and 
blue pegs interchanged. (See the article on this game in Mathematics Teacher, 
January 1982.) 

(a) Play this game for n — 3 and n= 4. 

(b) Use a combinatorial argument to show that in general, n 2 + 2 n moves (unit 


steps and jumps) are required to complete the game. 



5.2 SIMPLE ARRANGEMENTS AND SELECTIONS 

A permutation of n distinct objects is an arrangement, or ordering, of the n objects. 
An /"-permutation of n distinct objects is an arrangement using r of the n objects. An 
/•-combination of n distinct objects is an unordered selection, or subset, of r out of 
the n objects. We use P(n, r ) and C(n,r) to denote the number of r -permutations and 
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/•-combinations, respectively, of a set of n objects. From the multiplication principle 
we obtain 


Pin, 2) = n(n — 1), Pin, 3) = n(n — l)(n — 2), 

P(n, n) — n(n — 1 )(n — 2) x ■ ■ ■ x 3 x 2 x 1 

In enumerating all permutations of n objects, we have n choices for the first 
position in the arrangement, n — 1 choices (the n — 1 remaining objects) for the second 
position, . . . , and finally one choice for the last position. Using the notation n\ = 
n{n — l )(/2 — 2) ■ ■ • x 3 x 2 x 1 (n ! is said “« factorial”), we have the formulas 

P(n, n) = n\ 


and 

nl 

Pin, r) = nin— 1 )(n — 2) x • • • x [// — (/- — 1 )] = 

in — r)l 

Our formula for Pin, r) can be used to derive a formula for Cin, r). All /--permutations 
of n objects can be generated by first picking any /-combination of the n objects and 
then arranging these r objects in any order. Thus Pin, r) = Cin, r) x Pi r, r), and 
solving for Cin, r) we have 

Pin, r) n\/in — r)\ n\ 

Cin, /•) = = = 

Pir, r) r\ r!(/z — r)\ 

The numbers Cin, r) are frequently called binomial coefficients because of their role 
in the binomial expansion (x + >')"■ We study the binomial expansion and identities 
involving binomial coefficients in Section 5.5. It is common practice to write the 
expression Cin, r) as (") and to say “n choose r" [we usually write Cin, r) in this 
book]. Note that Cin, r) = Cin, n - r), since the number of ways to pick a subset of r 
out of n distinct objects equals the number of ways to throw away a subset of n-r. 

The rest of this section is a continuation of the combinatorial problem-solving 
examples in the previous section. Along with the introduction of permutations and 
combinations, these examples involve more complicated situations. With many new 
constraints to keep straight, readers may find themselves becoming confused about 
even basic issues such as when to add and when to multiply. One must think very 
carefully about each step in the analysis of a counting problem. 

There are two ways to analyze arrangement problems. We have the option of 
either picking which item goes in the first position, then which item goes in the 
second position, and so on, or picking which position to choose for the first item, 
which position to choose for the second item, and so on. 


Example 1: Ranking Wizards 

How many ways are there to rank n candidates for the job of chief wizard? If the 
ranking is made at random (each ranking is equally likely), what is the probability 
that the fifth candidate, Gandalf, is in second place? 
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A ranking is simply an arrangement, or permutation, of the n candidates. So there 
are n\ rankings. Intuitively, a random ranking should randomly position Gandalf, and 
so we would expect that he has probability 1 /n of being second, or being in any given 
position. Equivalently, each of the n candidates should have the same probability 
of being in second place. Formally, we calculate the probability of Gandalf being 
second using the probability formula for an event: the subset of outcomes producing 
the desired event divided by all possible outcomes (see Appendix A. 3 for details). 

no. of rankings with Gandalf second 

Prob(Gandalf second) = 

total no. of rankings 

Now comes the hard question. How do we count outcomes where Gandalf is ranked 
second? The answer lies in the observation preceding this example about the two 
ways to analyze arrangement problems: (i) picking an item for the first position, then 
the second position, and so on; or (ii) picking a position for the first item, then for the 
second item, and so on. We need to use the second approach and make Gandalf the 
first wizard to be given a position. 

We must put Gandalf in second place — 1 way — and then put the remaining 
wizards in the remaining n — 1 places — (n — 1)! ways. So Gandalf is second in 
(n — 1)! rankings, and Prob(Gandalf second) = in — \)\/n\ — I / n, as expected. ■ 


To count arrangements where a particular position is fixed — in the preceding 
example, that Gandalf is in second place — we do not count the ways to make Gandalf 
second (there is only one way to put Gandalf in second place) but rather count all the 
ways to arrange the remaining wizards in the remaining positions. We count what has 
not been constrained. 


Example 2: Arrangements with Repeated Letters 

How many ways are there to arrange the seven letters in the word SYSTEMS? In how 
many of these arrangements do the three Ss appear consecutively? 

The first question we must ask is : how do we modify the formula for arrangements 
of distinct items to incorporate the situation where one item may be repeated. In the 
previous example, we handled the new constraint, Gandalf ranked second, first and 
then arranged other wizards. In this example, we take the opposite approach and delay 
arranging the multiple Ss until the end. 

We first pick the positions that the other four letters E, M, T, Y will occupy in the 
seven-letter arrangement, and then the three Ss will fill the remaining three positions 
in one way. There are seven possible positions for E, six for M, five for T, and four 
for Y. Thus there are P(J, 4) = 7!/3! = 840 arrangements. (A general formula for 
counting arrangements with repeated objects is given in the next section.) 

Here again we counted arrangements in terms of where successive items should 
be placed rather than what should go in successive positions. 

Next we consider the case where the three Ss appear consecutively, that is, the 
three Ss are all side-by-side. The “trick” for handling this new consecutivity constraint 
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is to realize that when the three Ss are grouped together they now become a single 
composite letter. So the problem reduces to arranging the five distinct letters, Y, T, E, 
M and SSS (treated as a single letter), which can be done in 5 ! = 120 ways. 

Another way to look at this problem is to think of temporarily setting aside two 
of the Ss, arranging the five remaining letters, Y, T, E, M, S, in 5! ways, and then in 
each resulting arrangement inserting beside the S the other two Ss. ■ 


Example 3: Binary Sequences 

How many different 8-digit binary sequences are there with six Is and two Os? 

A good starting question is: what distinguishes one 8-digit binary sequence with 
six Is from another such sequence? The answer is the positions of the six Is (or the 
two Os). Though this problem initially reads as an arrangement problem, what must 
be counted is the different possible placements of the six Is, that is, different possible 
subsets of six of the eight positions in the binary sequence. (A sequence with Is in 
positions 1, 2, 3, 4, 5, 6 is the same as a sequence with Is in positions 6, 5, 2, 3, 4, 
1 — the order does not matter, just the collection of positions involved.) So the answer 
is C(8, 6) = 28. 

We could alternatively have focused on picking a subset of two of the eight 
positions for Os. ■ 


Example 4: Poker Probabilities 

(a) How many 5-card hands (subsets) can be formed from a standard 52-card deck? 

(b) If a 5-card hand is chosen at random, what is the probability of obtaining a flush 
(all five cards in the hand are in the same suit)? 

(c) What is the probability of obtaining three, but not four. Aces? 

(a) A 5-card hand is a subset of five cards chosen from the 52 cards in a deck, 
and so there are C(52, 5) = 52!/(47!5!) = 2,598,960 different 5-card hands. 

(b) To find the probability of a flush, we need to find the number of 5-card 
subsets with all cards of the same suit. There are four suits, and a subset of five 
cards from the 13 cards in a given suit can be chosen in C( 13, 5) = 13!/(5!8!) = 
1287 ways. So there are 4 x 1287 = 5148 flushes, and 

5148 

Prob(5-card hand is a flush) = = 0.00198 (^ 0.2%) 

2,598,960 

(c) To count the number of hands with exactly three Aces, we must pick three 
of the four Aces — done in C( 4, 3) = 4 ways — and then fill out the hand with two 
cards chosen from the 48 non-Ace cards — done in C(48, 2) = 1128 ways. So 
there are 4 x 1 128 = 4512 hands with exactly three Aces, and 

4512 

2,598,960 


Prob(5-card hand has exactly three Aces) = 


= .00174 ■ 
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Note that to count hands with three Aces in Example 4, we implicitly used the 
multiplication principle to multiply the ways to pick three Aces times the ways to fill 
out the hand with two non-Ace cards. However, a hand is an unordered collection, 
and by ordering it into two parts, we might generate two outcomes that are really the 
same set, violating the distinctness condition of the multiplication principle. In this 
problem, hands could safely be decomposed into an Aces part and a non-Aces part 
because the types of cards in the two parts were different. The next example illustrates 
how this disjointness condition can be violated. 

Example 5: Forming Committees 

A committee of k people is to be chosen from a set of seven women and four men. 
How many ways are there to form the committee if 

(a) The committee consists of three women and two men? 

(b) The committee can be any positive size but must have equal numbers of women 
and men? 

(c) The committee has four people and one of them must be Mr. Baggins? 

(d) The committee has four people and at least two are women? 

(e) The committee has four people, two of each sex, and Mr. and Mrs. Baggins cannot 
both be on the committee? 

(a) Applying the multiplication principle to disjoint collections, we can count 
all committees of three women and two men by composing all subsets of three 
women with all subsets of two men, C(7, 3) x C(4, 2) = 35 x 6 = 210 ways. 

(b) To count the possible subsets of women and men of equal size on the 
committee, we must know definite sizes of the subsets. That is, we must break 
the problem into the four disjoint subcases: one woman and one man, two each, 
three each, and four each (there are only four men available). So the total num- 
ber is the sum of the possibilities for these four subcases, [C(7, 1) x C( 4, 1)] + 
[C(7, 2) x C(4, 2)] + [C(7, 3) x C(4, 3)] + [C(7, 4) x C(4, 4)] = 7x4 + 21x 
6+ 35 x 4 + 35 x 1 = 329. 

(c) If Mr. Baggins must be on the committee, this simply means that the 
problem reduces to picking three other people on the 4-person committee from 
the remaining 10 people (seven women and three men). Observe the similarity of 
this constraint with the Gandalf-second constraint in Example 1 . In both cases, 
we need to focus on the ways of arranging or selecting the other people. So far 
we have made all selections from the set of men or the set of women. Now the 
other three people must be chosen from the set of all remaining people, and so the 
answer is C(10, 3) = 120. (It is easy to get in a mindset where one uses the same 
sets in the current problem that one used in the previous problem, here the set of 
women and the set of men — problem solvers must always be alert to this trap.) 

(d) One approach is to pick two women first, C (7, 2) = 2 1 ways, and then pick 
any two of the remaining set of nine people (five women and four men). However, 
counting all committees in this fashion counts some outcomes more than once. 


194 Chapter 5 General Counting Methods for Arrangements and Selections 


since any woman in one of these committees could be chosen as either one of the 
first two women or one of the two remaining people. For example, if IV', denotes 
the ith woman and M, the zth man, then ( IV i , W3) composed with the two remaining 
people ( W 2 , M3) yields the same set as ( Wi , W 2 ) composed with (W3, M3). 

A correct solution to this problem must use a subcase approach, as in part 
(b). That is, break the problem into three subcases that specify exactly how 
many women and how many men are on the committee: two women and two 
men, three women and one man, and four women. The answer is thus [C( 7, 2) x 
C(4, 2)] + [C(7,3)x C( 4, 1)] + C(7, 4) = 21 x 6 + 35 x 4 + 35 = 301. 

(e) We need to recast the condition “Mr. and Mrs. Baggins cannot both be on 
the committee” into several simpler subcases in which we know exactly which 
of Mr. and Mrs. Baggins is on the committee. Note that the possibility of neither 
Mr. nor Mrs. Baggins’ being on the committee satisfies the given condition. 
There are three subcases to consider. 

The first subcase is that Mrs. Baggins is on the committee and Mr. Baggins 
is not. Then one more woman must be chosen from the remaining six women and 
two more men must be chosen from the remaining three men (Mr. Baggins is ex- 
cluded). This can be done inC(6, 1)xC(3,2) = 6x3 = 18 ways. The other two 
cases are: Mrs. Baggins is off and Mr. Baggins is on, which by a similar argument 
yields C( 6, 2) x C( 3, 1) = 15 x 3 = 45 ways; and neither is on the committee, 
C( 6, 2) x C(3, 2) = 15 x 3 = 45 ways. The total answer is 18 + 45 + 45 = 108. 

An easier solution to this problem can be obtained by taking a complementary 
approach. We can consider all C(7, 2) x C( 4, 2) 2-women-2-men committees 
and then subtract the forbidden committees that contain both Bagginses. The 
forbidden committees are formed by picking one more woman and one more man 
to join Mr. and Mrs. Baggins — done in C( 6, 1) x C( 3, 1) ways. This approach 
yields a simpler formula for the same answer: 21x6 — 6x3 = 108. ■ 

Parts (b), (d), and (e) of Example 5 illustrate an important point. When counting 
the ways to pick elements in a given subset, as a part of a more complex problem, 
one needs to specify the number of elements in the subset. If the size of the subset 
can vary, then one must break the problem into subcases so that the size of the subset 
is a fixed number in each subcase. 

The mistake of counting the same outcome twice, which arose in part (d) of 
Example 5, arises in many guises. The following principle should help the reader 
avoid this problem. 

The Set Composition Principle 

Suppose a set of distinct objects is being enumerated using the multiplication principle, 
multiplying the number of ways to form some first part of the set by the number of 
ways to form a second part (for a given first part). The possible members of the first 
and second parts must be disjoint. Another way to express this condition is: given any 
set S thus constructed, one must be able to tell uniquely which elements of S are in 
the first part of S and which elements are in the second part. 
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In Example 5(a), the part W of three women and the part M of two men are disjoint, 
and so by the set composition principle the number of committees with three women 
and two men is the size of W times the size of M. In the “committee with at least two 
women” problem in Example 5(d), the method of choosing two of the seven women 
first and then picking any remaining two people (women or men) violates the Set Com- 
position Principle because the members of the two parts are not disjoint — any woman 
could be in either the first or second part. For example, in the committee { W), Wj, W3, 
M3} it is impossible to say which two women are chosen in the first part. The scheme 
of picking two women and then any two remaining people generates the committee 
{W \ , W 2 , W3, M3} in C(3, 2) different ways — namely, all compositions of (i) two of the 
three women W \ , W2, W3 ; with (ii) M3 plus the remaining woman in W 1 , VY , W3 not 
chosen in (i). 

In Example 2 about arrangements of the letters of SYSTEMS, we dealt with 
the constraint that certain letters had to be grouped together in the arrangement. In 
Example 2, it was three consecutive Ss. In the following continuation of that example, 
we consider the constraint of requiring a particular letter to appear somewhere 
before another letter in the arrangement. We also show how to grapple with the two 
constraints simultaneously. ■ 


Example 2: (continued) Arrangements with Repetitions 

How many arrangements of the seven letters in the word SYSTEMS have the E 
occurring somewhere before the M? How many arrangements have the E somewhere 
before the M and the three Ss grouped consecutively? 

The key to the constraint of E being somewhere before the M (not necessarily 
immediately before the M) is to focus on the pair of positions where E and M will go. 
Thus, we start by picking which of the two out of the seven positions in an arrangement 
are where the E and M will go — C(7, 2) = 21 ways — and then we put E in the first of 
this pair of positions and M in the second one. Now we fill in the five other positions 
in the arrangement by picking a position for the Y and the T — P (5, 2) = 5 x 4 = 20 
ways — and then putting the three Ss in the three remaining positions. The answer is 
thus 21 x 20 = 420. 

While it may sound scary to deal with the two constraints at once, it often turns 
out to be less hard than expected if one handles the constraints in the right order. If 
we first pick the pair of the positions for the E and the M, things get messy for the 
consecutivity constraint because the different placements of the E and M will impact 
differently the positions in the arrangement where there is enough room to place three 
consecutive Ss. 

So we try dealing with the consecutive Ss first. This simply requires that we “glue” 
the three Ss together into one composite letter and consider the reduced problem of 
arranging five letters, Y, T, E, M, (SSS). Now we turn to the other constraint and pick 
the pair of positions, out of the five new positions, for the E and the M — C(5, 2) = 
10 ways — with the E going in the first of the two chosen positions. Then we arrange 
the Y, T and (SSS) in the remaining three positions in 3 ! = 6 ways. So the answer is 
10 x 6 = 60. ■ 
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The following type of counting problem arises frequently in quality-control 
problems. 

Example 6: Counting Defective Products 

A manufacturing plant produces ovens. At the last stage, an inspector marks the ovens 
A (acceptable) or U (unacceptable). How many different sequences of 15 As and Us 
are possible in which the third U appears as the twelfth letter in the sequence? 

This problem is a binary sequence problem similar to Example 5 except now the 
elements are A and U, rather than 0 and 1 . If the third U appears at the twelfth letter in 
the sequence, then the subsequence composed of the first 1 1 letters must have exactly 
two Us (and nine As). Following the reasoning in Example 5, there are C(1 1, 2) = 55 
possible sequences for the first 1 1 letters. There is one way to pick the twelfth letter — 
it is specified to be U. The remaining three letters in the sequence can be either A or 
U — 2 3 = 8 possibilities. All together, there are 55 x 1 x 8 = 440 sequences. ■ 


Example 7: Probability of Repeated Digits 

What is the probability that a 4-digit campus telephone number has one or more 
repeated digits? 

There are 10 4 = 10,000 different 4-digit phone numbers. We break the problem 
of counting 4-digit phone numbers with repeated digits into four different cases of 
repetitions: 

(a) All four digits are the same. 

(b) three digits are the same, the other is different. 

(c) two digits are the same, the other two digits are also the same (e.g., 2828). 

(d) two digits are the same, the other two digits are each different (e.g., 5105). 

(a) There are 10 numbers formed by repeating one of the 10 digits four times. 

(b) One way to decompose the process of generating these case-(b) numbers 
is as follows (several other decompositions are possible). First pick which digit 
appears once — 10 choices — then where it occurs in the 4-digit number — four 
choices — and finally which other digit appears in the other three positions — nine 
choices. This yields 10 x 4 x 9 = 360 numbers. 

(c) First pick which two digits are each to appear twice — C(10, 2) = 45 
choices — and then how to arrange these four digits: pick which two positions are 
used by, say, the smaller of the digits — C( 4, 2) = six ways. This yields 45 x 6 = 
270 numbers. 

(d) First pick which pair of digits appear once — C (10,2) = 45 choices — then 
pick a position for, say, the smaller of these two digits and a position for the larger 
digit — 4 x 3 = 12 choices — and finally pick which other digit appears in the 
remaining two positions — eight choices. This yields 45 x 12 x 8 = 4320 numbers. 

In sum, there are 10 + 360 + 270 + 4320 = 4960 4-digit phone numbers with a 
repeated digit. The probability of a repeated digit is thus 4960/10,000 = 0.496 ~ 0.5. 
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We note that there happens to be a simpler way, using the complementary set, 
to count phone numbers with repeated digits: count numbers with no repeated dig- 
its. These are just the P( 1 0. 4) = 5040 four-permutations of the 10 digits. So the 
remaining repeated-digit numbers amount to 10,000 — 5040 = 4960. ■ 

One point of caution about cases (c) and (d) where two different digits both occur 
once or both occur twice. In case (d), we pick the two digits occurring once as an 
unordered pair in C(10, 2) ways and arrange those digits (and then pick the digit 
to go in the remaining two positions) rather than pick a first digit, position it, then 
pick a second digit, position it (and then pick the digit to go in the remaining two 
positions) — 10 x4x9x3x8 ways. In this latter (wrong) approach, we cannot tell 
for a telephone number such as 2529 whether the 5 was chosen first and put in the 
second position and then the 9 chosen next and put in the fourth position, or whether 
the 9 was chosen first and put in the fourth position and then the 5 chosen next and put 
in the second position. The disjointness requirement of the multiplication principle is 
being violated and each outcome in case (d) would be counted twice. 

Example 8: Voter Power [Optional] 

We consider a way of measuring the influence of different players in weighted voting. 
Suppose that in a 5-person regional council there are three representatives from small 
towns, call them a, b, c, who each cast one vote, and there are two representatives 
from large towns, call them D, E, who each cast two votes. With a total of seven votes 
cast, it takes four votes (a majority of votes) in favor of legislation to enact it. Suppose 
that in forming a coalition to vote for some legislation, the people join the coalition in 
order (an arrangement of the people). The pivotal person in a coalition arrangement 
is the person whose vote brings the number of votes in the coalition up to four. For 
example, in the coalition arrangement bDcaE, the pivotal person is c. A measure of the 
“power” of a person p in the council is the fraction of coalition arrangements in which 
p is the pivotal person. This measure of power is called the Shapley-Shubik index. 

Determine the Shapley-Shubik index of person a and person D in this council — 
that is, determine the fraction of all coalition arrangements in which a and D, respec- 
tively, are pivotal. (By symmetry, the other 1-vote people b and c will have the same 
index as a, and similarly E will have the same index as D .) 

If a is pivotal in a coalition arrangement, then people with (exactly) three votes 
must precede a in the arrangement and people with three votes must follow a. Since 
there are only two other 1-vote people, the three votes preceding a must come from 
one 1 -vote person — b or c — and one 2-vote person — D or E. Then the beginning of 
the coalition can be formed in 2 (choice of 1-vote person) x 2 (choice of 2-vote 
person) x 2 (whether 1-vote or 2-vote person goes first) = 8 ways. The remain- 
ing 1-vote and remaining 2-vote person will follow a, with two ways to arrange 
them. In total there are 8x2= 16 coalition arrangements in which a is pivotal. 
There are 5! = 120 arrangements in all, and so the Shapley-Shubik index of a is 
16/120 = 4/30. 

If D is pivotal in a coalition arrangement, there can be people with two or three 
votes preceding D. Suppose there are two votes before D and three votes after D. 
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Either the arrangement starts with two of the three 1 -vote people — arranged 3x2 
ways — then D, followed by the other 1 -vote person and E in either order — two ways — 
or the arrangement starts with E, then D, followed by an arrangement of the three 
1-vote people — 3! ways. In total, there are (3 x 2 x 2) + 3! = 18 arrangements with 
two votes before D and three votes after D. By interchanging the people before D with 
the people after D in these arrangements, we obtain the arrangements with of three 
votes before D and two votes after D. So there are 18 of the latter arrangements. In 
total, there are 18+18 = 36 arrangements in which D is pivotal, and so D’s Shapley- 
Shubik index is 36/120 = 9/30. 

Observe that a 2-vote person has an index 21 times the size of a 1-vote person. ■ 


We close this section by noting that for large values of n, n \ can be approximated 
by the number s n = *J2 7tn(n/e) n , where e is Euler’s constant ( e = 2.718 . . . ). This 
approximation is due to Stirling and its derivation is given in most advanced calculus 
texts (see Buck [1]). The error \n ! — s n | increases as n increases, but the relative error 
| n ! — s„\/s n is always less than 1/1 In. The following table gives some sample values 
of n\, s n , and |n! — s n \/s„. 


n 

n! 

Sn 

\n\ -s„\/s n 

i 

i 

.922 

.085 

2 

2 

1.919 

.042 

5 

120 

118.02 

.017 

10 

3,628,800 

3,598,600 

.008 

20 

2.433 x 10 18 

2.423 x 10 18 

.004 

100 

9.333 x 10 157 

9.328 x 10 157 

.0005 



5.2 EXERCISES 


Summary of Exercises As in the previous section, most of these 
exercises require individual analysis, different for each problem. Remember to read 
problems carefully to avoid misinterpretation. Pay special attention to whether a 
problem involves arrangements or subsets. Note that the problems assume that people 
are distinct objects (no identical people). 


1. How many ways are there to arrange the cards in a 52-card deck? 

2. How many different 5-letter “words” (sequences) are there with no repeated 
letters formed from the 26-letter alphabet? 

3. How many ways are there to distribute six different books among 13 children if 
no child gets more than one book? 

Comment on the answers (a) C(13, 6), (b) 1 3 6 , and (c) C(13, 6)6! 

4. How many ways are there to seat six different boys and six different girls along one 
side of a long table with 12 seats? How many ways if boys and girls alternate seats? 
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5. How many arrangements are there of the seven letters in the word UNUSUAL? 

6. How many ways are there to pick a subset of four different letters from the 
26-letter alphabet? 

7. How many ways are there to pick a 5-person basketball team from 14 possible 
players? How many teams if the weakest player and the strongest player must 
be on the team? 

8. There are nine white balls and four red balls in an urn. How many different ways 
are there to select a subset of six balls, assuming the 13 balls are different? What 
is the fraction of selections with four whites and two reds? 

9. If a fair coin is flipped 1 1 times, what is the probability of nine or more heads? 

10. What is the probability that an arrangement of a, b, c, e, f g begins and ends 
with a vowel? 

11. Given six distinct pairs of gloves, 12 distinct gloves in all, how many ways are 
there to distribute two gloves to each of six sisters 

(a) If the two gloves someone receives might both be for the left hand or right 
hand? 

(b) If each sister gets one left-hand glove and one right-hand glove? 

12. How many ways are there to partition 12 people into: 

(a) Three groups of sizes two, four, and six? 

(b) Two (unordered) groups of size six? 

13. How many 7-letter sequences (formed from the 26 letters in the alphabet, with 
repetition allowed) contain exactly one A and exactly two Bs? 

14. (a) On a 10-question test, how many ways are there to answer exactly eight 

questions correctly? 

(b) Repeat part (a) with the requirement that the first or second question, but 
not both, are answered correctly. 

(c) Repeat part (a) in the case that three of the first five questions are answered 
correctly. 

15. How many n-digit ternary (0, 1 , 2) sequences with exactly nine Os are there? 

Comment on the answers (a) 3" -9 , (b) 2” -9 , and (c) C(n, 8)3" -9 . 

16. What is the probability that a five-card poker hand has the following? 


(a) Four Aces 

(d) A full house (three of a kind and 

(b) Four of a kind 

a pair) 

(c) Two pairs (not four of a kind or 

(e) A straight (a set of five consecutive 

a full house) 

values) 


(f) No pairs (possibly a straight or 


flush) 


17. What is the probability that a randomly generated n -digit ternary sequence has 
exactly k Os? 
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18 . What is the probability that a randomly chosen 10-card hand has exactly three 
three-of-a-kinds (and no four-of-a-kinds)? 

19 . How many 10-letter sequences (formed from the 26 letters in the alphabet, with 
repetition allowed) contain exactly m Rs and exactly n Ts? 

20 . Of a company’s personnel, seven work in design, 14 in manufacturing, four in 
testing, five in sales, two in accounting, and three in marketing. A committee of 
six people is to be formed to meet with upper management. 

(a) In how many ways can the committee be formed if there must be exactly 
two members from the manufacturing department? 

(b) Lucy works in the design department and her husband Ricky works in 
marketing. In how many ways can the committee be formed if they cannot 
both be on the committee together? 

(c) In how many ways can the committee be formed if there must be at least 
two members from the manufacturing department? 

21 . How many ways can a committee be formed from four men and six women with 

(a) At least two men and at least twice as many women as men? 

(b) Between three and five people, and Ms. Wonder is excluded? 

(c) Five people, and not all of the three O’Hara sisters can be on the committee? 

(d) Four members, at least two of whom are women, and Mr. and Mrs. Baggins 
cannot both be chosen? 

22 . Suppose a class of 50 students has 20 males and 30 females. The instructor will 
pick one (different) student to compete in three different national competitions — 
in mathematics, chemistry, and English. 

(a) What is the probability that there is exactly one female student selected? 

(b) What is the probability that there are at least two male students selected? 

23 . Suppose that campus telephone numbers consist of any four digits (repetition 
allowed). 

(a) What is the probability that the digit 6 appears at least twice in a campus 
telephone number? 

(b) What is the probability that a campus telephone number contains exactly 
two different digits (e.g., 2444)? 

(c) What is the probability that a campus telephone number consists of four 
distinct digits in ascending order (e.g., 2578)? 

24 . There are eight applicants for the job of dog catcher and three different judges 
who each rank the applicants. Applicants are chosen if and only if they appear 
in the top three in all three rankings. 

(a) How many ways can the three judges produce their three rankings? 

(b) What is the probability of Mr. Dickens, one of the applicants, being chosen 
in a random set of three rankings? 
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25. There are six different French books, eight different Russian books, and five 
different Spanish books. How many ways are there to arrange the books in a 
row on a shelf with all books of the same language grouped together? 

26. If 1 3 players are each dealt four cards from a 52-card deck, what is the probability 
that each player gets one card of each suit? 

27. How many 10-letter (sequences) are there using five different vowels and five 
different consonants (chosen from the 21 possible consonants)? What is the 
probability that one of these words has no consecutive pair of consonants? 

Comment on the answers (a) 3(n — 1 ) !, (b) (3 !(n — 3)!) 3 , and (c) 3((n— l)!) 3 . 

28. What is the probability that a random 9-digit Social Security number has at least 
one repeated digit? 

29. What is the probability that an arrangement of a, b, c, d, e,f has 

(a) a and b side-by-side? 

(b) a occurring somewhere before hi 

30. How many ways are there to pair off 10 women at a dance with 10 out of 20 
available men? 

31. How many triangles are formed by (assuming no three lines cross at a point) 

(a) Pieces of n nonparallel lines; for example, the four lines below form four 
triangles: acd, abf, efd, and ebcl 

(b) Pieces of n lines, m of which are parallel and the others mutually nonparallel? 



32. There are three women and five men who will split up into two four-person 
teams. How many ways are there to do this so that there is (at least) one woman 
on each team? 

33. A man has n friends and invites a different subset of four of them to his house 
every night for a year (365 nights). How large must n be? 

34. How many arrangements of JUPITER are there with the vowels occurring in 
alphabetic order? 

Comment on the answers (a) 3!4!, (b) 7 !/3 !, and (c) C(7, 3)4! 

35. Determine the Shapley-Shubik index of a 1-vote person and a 2-vote person in 
the councils with the following make up: 

(a) Three 1-vote people and one 2-vote person. 

(b) Two 1-vote people and two 2-vote people (majority is four). 
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(c) Four 1-vote people and three 2-vote people (majority is six). 

(d) Four 1-vote people and four 2-vote people (majority is seven). 

36. How many 7-card hands dealt are there with three pairs (each of a different kind, 
plus a seventh card of a different kind)? 

37. How many arrangements of the 26 letters of the alphabet in which 

(a) a occurs before bl 

(b) a occurs before b and c occurs before dl 

(c) the five vowels appear in alphabetical order? 

38. A student must answer five out of 10 questions on a test, including at least two 
of the first five questions. How many subsets of five questions can be answered? 

39. How many 6-letter sequences are there with at least three vowels (A, E, I, 

0, U)? No repetitions are allowed. 

40. How many arrangements of 1, 1, 1, 1, 2, 3, 3 are there with the 2 not beside 
either 3? 

41. How many arrangements of INSTRUCTOR are there in which there are exactly 
two consonants between successive pairs of vowels? 

42. How many “words” can be formed by rearranging INQUISITIVE so that U does 
not immediately follow Q? 

43. Suppose a subset of 60 different days of the year are selected at random in a 
lottery. What is the probability that there are five days from each month in the 
subset? (For simplicity, assume a year has 12 months with 30 days each.) 

44. Suppose a subset of eight different days of the year is selected at random. 
What is the probability that each day is from a different month? (For simplicity, 
assume a year has 12 months with 30 days each). 

45. What is the probability of randomly choosing a permutation of the 10 digits 0, 

1, 2, . . . , 9 in which 

(a) An odd digit is in the first position and 1, 2, 3, 4, or 5 is in the last position? 

(b) 5 is not in the first position and 9 is not in the last position? 

46. (a) What is the probability that a 5-card hand has at least one card of each 

suit? 

(b) Repeat for a 6-card hand. 

47. What is the probability that a 5-card hand has 

(a) At least one of each of the four values: Ace, King, Queen, and lack? 

(b) The same number of hearts and spades? 

48. How many n-digit decimal (0, 1, 2) sequences are there with A: Is? 

49. What fraction of all arrangements of INSTRUCTOR have: 

(a) Three consecutive vowels? 

(b) Two consecutive vowels? 
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50. If one quarter of all 3-subsets of the integers 1,2 , ,m contain the integer 5, 

determine m. 

51. How many ways are there to form an (unordered) collection of four pairs of two 
people chosen from a group of 30 people? 

52. What fraction of all arrangements of GRACEFUL have: 

(a) F and G appearing side by side? 

(b) No pair of consecutive vowels? 

53. How many arrangements of SYSTEMATIC are there in which each S is followed 
by a vowel (this includes Y)? 

54. How many arrangements of MATHEMATICS are there in which each consonant 
is adjacent to a vowel? 

55. (a) What is the probability that k is the smallest integer in a subset of four 

different numbers chosen from 1 through 20 (1 < k < 17)? 

(b) What is the probability that k is the second smallest? 

56. What is the probability that the difference between the largest and the smallest 
numbers is k in a subset of four different numbers chosen from 1 through 20 
(3 <k< 19)? 

57. How many ways are there to place eight identical black pieces and eight identical 
white pieces on an 8 x 8 chessboard? 

58. How many 8-letter arrangements can be formed from the 26 letters of the 
alphabet (without repetition) that include at most three of the five vowels and in 
which the vowels appear in alphabetical order? (Hint: Break into cases.) 

59. What is the probability that two (or more) people in a random group of 25 people 
have a common birthday? (This is the famous Birthday Paradox Problem.) 

60. How many ways are there to arrange n (distinct) people so that (i) Mr. and Mrs. 
Smith are side by side; and (ii) Mrs. Tucker is k positions away from the Smiths 
(i.e., k — 1 people between Mrs. Tucker and the Smiths)? 

61. A basketball team has five players, three in ‘ ‘forward” positions (this includes the 
‘ ‘center”) and two in ‘ ‘guard” positions. How many ways are there to pick a team if 
there are six forwards, four guards, and two people who can play forward or guard? 

62. A family has two boys and three girls to send to private schools. There are five 
boys’ schools, eight girls’ schools, and three coed schools. If each child goes 
to a different school, how many different subsets of five schools can the family 
choose for their children? 

63. Given a collection of 2 n objects, n identical and the other n all distinct, how 
many different subcollections of n objects are there? 

64. A batch of 50 different automatic typewriters contains exactly 10 defective 
machines. What is the probability of finding 

(a) At least one defective machine in a random group of five machines? 

(b) At least two defective machines in a random group of 10 machines? 
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(c) The first defective machine to be the Mi machine taken apart for inspection 
in a random sequence of machines? 

(d) The last defective machine to be the Mi machine taken apart? 

65. Ten fish are caught in a lake, marked, and then returned to the lake. Two days 
later 20 fish are again caught, two of which have been marked. 

(a) Find the probability of two of the 20 fish being marked if the lake has k fish 
(assuming the fish are caught at random). 

(b) What value of k maximizes the probability? 


66. Professor Grinch’s telephone number is 6328363. Mickey remembers the 
collection of digits but not their order, except that he knows the first 6 is before 
the first 3. How many arrangements of these digits with this constraint are there? 

67. How many arrangements of the letters in PREPOSTEROUS are there in which 
the first vowel to appear is an E? 

68. For the given map of roads between city A and city B, 


(a) How many routes are there from A to B that do not repeat any road (a road 
is a line segment between two intersections)? 

(b) How many ways are there for two people to go from A to B without both 
ever traversing the same road in the same direction? 

69 . What is the probability that a random five-card hand has 

(a) Exactly one pair (no three of a kind or two pairs)? 

Comment on the answers (a) 13 x C(4, 2) x C(48, 3)/C(52, 5), 

(b) 13 x C(4, 2) x 48 x 44 x 40/C(52, 5), and (c) (52 x 3 x 48 x 44 x 40)/ 
(52x 51 x 50x49 x48). 

(b) One pair or more (three of a kind, two pairs, four of a kind, full house)? 

(c) The cards dealt in order of decreasing value? 

(d) At least one spade, at least one heart, no diamonds or clubs, and values of 
all spades greater than the values of all hearts? 

70 . How many ways are there to pick a group of n people from 100 people (each of 
a different height) and then pick a second group of m other people such that all 
people in the first group are taller than the people in the second group? 

71 . How many subsets of three different integers between 1 and 90 inclusive are 
there whose sum is 

(a) An even number? 

(b) Divisible by 3? 

(c) Divisible by 4? 
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72. How many arrangements are there of the letters in REPETITIVENESS such that 
the Ss are consecutive AND the first I comes (somewhere) before the first E ? 

73. How many arrangements are there of the letters in STATISTICIANS such that 
the Is are consecutive and the first S comes (somewhere) before the first T? 

74. How many arrangements of the letters in MATHEMATICS are there such that 
the last vowel in the arrangement is an I. 

75. In a class of 20 students, one student is chosen president, another one is chosen 
secretary, and another is chosen treasurer. Such elections are held each week for 
six weeks. A student can be elected to one of the positions more than once. How 
many sequences of six election outcomes are possible? 

76. Sweaters are made in a design with three bands of colors: a top color, a middle 
color, and a bottom color. Eight colors are available and a color can appear in 
only one level. How many subsets of 12 different sweaters are there? 

77. Each Saturday for 12 weeks, a different pair is chosen from eight senior citizens 
to play a chess match. How many sequences of 12 different pairs are possible? 

78. How many arrangements of PEPPERMILL are there in which MP appear con- 
secutively or LP appear consecutively but not both MP and LP are consecutive? 

79. How many arrangements of UNUSUALLY are there in which SU appear con- 
secutively or LU appear consecutively but not both SU and LU are consecutive? 

80. How many ways are there for a woman to invite different subsets of three of her 
five friends on three successive days? How many ways if she has n friends? 

81. How many triangles can be formed by joining different sets of three corners of 
an octagon? How many triangles if no pair of adjacent corners is permitted? 

82. How many arrangements of five Os and ten Is are there with no pair of 
consecutive Os? 

83. (a) How many points of intersection are formed by the chords of an n-gon 

(assuming no three of these lines cross at one point)? 

(b) Into how many line segments are the lines in part (a) cut by the intersection 
points? 

(c) Use Euler’s formula r=e— v+2 and parts (a) and (b) to determine the 
number of regions formed by the chords of an n-gon. 

84. Given 14 positive integers, 12 of which are even, and 16 negative integers, 11 
of which are even, how many ways are there to pick 12 numbers from this 
collection of 30 integers such that six of the 12 numbers are positive and six of 
the 12 numbers are even? 

85. How many triangles are formed by (assuming no three lines cross at a point): 

(a) Pieces of three chords of a convex 10-gon such that the triangles are wholly 
within the 10-gon (a corner of the 10-gon cannot be a corner of any of these 
triangles)? 

(b) Pieces of three chords or outside edges of a convex n-gon? 
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86 . How many arrangements of the integers 1, 2, . . . , n are there such that each 
integer differs by 1 (except the first integer) from some integer to the left of it 
in the arrangement? 


87. A man has seven friends. How many ways are there to invite a different subset 
of three of these friends for a dinner on seven successive nights such that each 
pair of friends are together at just one dinner? 



5.3 ARRANGEMENTS AND SELECTIONS 
WITH REPETITIONS 

In this section we discuss arrangements and selections with repetition — arrangements 
of a collection with repeated objects, such as the collection b, a, n, a, n, a, and selections 
when an object can be chosen more than once, such as ordering six hot dogs chosen 
from three varieties. We motivate the formulas for these counting problems with the 
two examples just mentioned. 


Example 1: Arrangements of banana 

How many arrangements are there of the six letters b, a, n, a, n, a? 

Consider a possible arrangement: 

n a_ b n a a 

This problem is solved by an extension of reasoning used to solve the problem of 
counting 8-digit binary sequences with six Is (Example 3 in Section 5.2). The key 
is to focus on the subset of positions where the as go and the subset of positions 
where the ns go. For example, the above arrangement is characterized by having as in 
positions 2, 5, 6, ns in positions 1, 4, and b in position 3. We count the arrangements 
by first choosing the subset of three positions in the arrangement where the as will 
go — C(6, 3) = 20 ways — then the subset of two positions (out of the remaining three) 
where the ns will go — C(3, 2) = three ways — and finally the last remaining position 
gets the b — 1 way. Thus there are C(6, 3) x C(3, 2) x C(l, 1) = 20 x 3 x 1 = 60 
arrangements. ■ 


Theorem 1 

If there are n objects, with r\ of type I , r 2 of type 2, . . . , and r m of type m, where 
r i + r 2 + ■ — h r m = n, then the number of arrangements of these n objects, denoted 
P(n\ n, r 2 , r m ), is 


P{n\r i, r 2 , . . . , r m ) = 


r\ ) \ r 2 
n\ 

n\r 2 \ . ,.r m \ 


n \ t n — r\ \ / n — r\—r 2 


f3 


n~n~r 2 r m _ j 

r m 


(*) 
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Proof 1 

First pick r\ positions for the first types, then ri of the remaining positions for the 
second types, and so on. A mathematically precise proof of the “etc.” part requires 
induction (see Exercise 35). The second line of (*) is just a simplification that results 
from canceling factorials in the binomial coefficients; for example. 


P( 6; 3, 2, 1) = 




1\ 6! 3! 1! 6! 

, 1 = x x — = ♦ 

1/ 3!3! 2 ! 1 ! 1! 3!2!1! 


Proof 2 

This proof is similar to our derivation of C(n, r) through the equation Pin, r) = 
C(n, r) x P(r, r). Suppose that for each type, the r, objects of type i are given subscripts 
numbered 1, 2, . . . , r, to make each object distinct. Then there are n\ arrangements 
of the n distinct objects. Let us enumerate these n\ arrangements of distinct objects 
by enumerating all P(n; r\ , r 2 , . . . , r m ) patterns (without subscripts) of the objects, 
and then for each pattern placing the subscripts in all possible ways. For example, the 
pattern baanna can have subscripts on as placed in the 3! ways: 

ba^nnai ba2a\nna-$ baT,a\nna2 

baT,a2nna\ baia-i,nna2 ba2CiT,nna\ 

For each of these 3! ways to subscript the as, there are 2! ways to subscript the 
ns. Thus, in general a pattern will have /-|! ways to subscript the r\ objects of type 1, 
rj. ways for type 2, and r m \ ways for type m. Then 

n\ = P(n\n, r 2 , ..., r m )rf.r 2 \ . . . r m ! 
or 

n\ 

P(n-,n,r 2 , ..., r m ) = — — - ♦ 

r\ lr 2 ! . . ■ r m ! 


Example 2: Ordering Hot Dogs 

How many different ways are there to select six hot dogs from three varieties of 
hot dog? 

To solve such a selection-with-repetition problem, we recast it as an arrangement 
problems as follows. Suppose the three varieties are regular dog, chili dog, and super 
dog. Let a selection be written down on an order form (when a person places this 
order) in the following fashion: 


Regular 

Chili 

Super 

X 

xxxx 

X 


Each x represents a hot dog. The request shown on the form above is one regular, 
four chili, and one super. Since the hot dog chef knows that the sequence of dogs on 
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the form is regular, chili, super, the request can simply be written as x\xxxx\x without 
column headings. 

Any selection of r hot dogs will consist of some sequence of r xs and two |s. 
Conversely, any sequence of r xs and two |s represents a unique selection: the xs 
before the first | count the number of regular dogs; the xs between the two |s count 
chilis; and the final xs count supers. So there is a one-to-one correspondence between 
orders and such sequences. Counting the number of sequences of six xs and two |s is 

( 8\ /2\ Q | 

6/ ( 2 ) = 6fU ' 

As discussed in Example 3 of Section 5.2, counting such sequences of xs and |s is 
simply a matter of picking the subset of positions where the xs (or the | s) go — again, 

© wa y s - ■ 

Theorem 2 

The number of selections with repetition of r objects chosen from n types of objects 
is C(r + n — 1, r). 

Proof 

We make an “order form” for a selection just as in Example 2, with an x for each 
object selected. As before, the xs before the first | count the number of the first type 
of object, the xs between the first and second |s count the number of the second type, 
. . . , and the xs after the (n — l)-st | count the number of the nth type (n — 1 slashes 
are needed to separate n types). The number of sequences with r xs and n — 1 |s is 
C(r + (n — 1), r). ♦ 

Example 3: Grouping Classes 

Nine students, three from Ms. A’s class, three from Mr. B’s class, and three from 
Ms. C’s class, have bought a block of nine seats for their school’s homecoming game. 
If three seats are randomly selected for each class from the nine seats in a row, what 
is the probability that the three A students, three B students, and three C students will 
each get a block of three consecutive seats? 

In probability problems, we seek the number of favorable outcomes divided by 
all outcomes. The first question is, what is the set of all outcomes in this problem? 
They are the P( 9; 3, 3, 3) = 9!/3!3!3! = 1680 ways to arrange three As, three Bs and 
three Cs in the row of nine seats. 

If the three students of each class are to sit together, then we want to count 
outcomes that are arrangements of the three blocks, AAA, BBB, and CCC. Thus, 
instead of nine letters, we really are working now with three composite letters. There 
are 3 ! = 6 ways to arrange these three composite letters. So the probability that each 
class sits together is 6/1680. ■ 

Example 4: Sequencing Genes 

The genetic code of organisms is stored in DNA molecules as a long string of four 
nucleotides: A (adenine), C (cytosine), G (guanine), and T (thymine). Short strings of 
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DNA can be “sequenced” — the sequence of letters determined — by various modern 
biotech methods. Although the DNA sequence for a single gene typically has hundreds 
or thousands of letters, there exist special enzymes that will split a long string into short 
fragments (which can be sequenced) by breaking the string immediately following 
each appearance of a particular letter. 

Suppose a C-enzyme (which splits after each appearance of C) breaks a 20-letter 
string into eight fragments, which are identified to be: AC, AC, AAATC, C, C, C, 
TATA, TGGC. Note that each fragment, except the last one on the string, must end 
with a C. How many different strings could have given rise to this set of fragments? 

Since the fragment TATA does not end with a C, it must go at the end of the 
string. The other seven fragments can occur in any order. These fragments consist 
of three Cs, two ACs, one AAATC, and one TGGC. Treat each fragment as a letter, 
similar to the reasoning in Example 3. Then we must arrange seven letters, three of 
one type, two of a second type, and one each of two other types. There are thus P(7 ; 
3, 2, 1, 1) = 420 possible arrangements of the fragments to form a 20-letter string. ■ 


If we use an A-enzyme to break the same string into fragments and look at 
all the possible arrangements of these fragments and then do the same with a 
G-enzyme and a T-enzyme, there will normally be only one string that appears in 
all four sets of arrangements. This will be the true original string. More sophisticated 
variations on this approach are used to determine the DNA sequence of entire genes. 

Example 5: Sequences with Varying Repetitions 

How many ways are there to form a sequence of 10 letters from four as, four bs, 
four cs, and four c/s if each letter must appear at least twice? 

To apply Theorem 1 we need to know exactly how many as, bs, cs, and ds will 
be in the arrangement. Thus we have to break this problem into a set of subproblems 
that each involves sequences with given numbers of as, bs, cs, and ds. There are two 
categories of letter frequencies that sum to 10 with each letter appearing two or more 
times. The first category is four appearances of one letter and two appearances of 
each other letter. The second is three appearances of two letters and two appearances 
of the other two letters. 

In the first category, there are four cases for choosing which letter occurs four 
times and Pi 1 0; 4, 2, 2, 2) = 18,900 ways to arrange four of one letter and two of 
the three others. In the second category, there are C(4, 2) = 6 cases for choosing 
which two of the four letters occur three times and P(10; 3, 3, 2, 2) = 25,200 ways 
to arrange three of two letters and two of the two others. The final answer is 4 x 
18,900 + 6 x 25,200 = 226,800 ways. ■ 

Example 6: Selecting Doughnuts 

How many ways are there to fill a box with a dozen doughnuts chosen from five 
different varieties with the requirement that at least one doughnut of each variety is 
picked? 
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The starting question to solve this problem is: how do we recast selection with at 
least one of each kind in terms of an unconstrained selection with repetition or some 
other counting problem we know how to solve? Handling this constraint involves an 
insight that is hard to find but follows immediately from the way people make such 
a doughnut selection. They would first pick one doughnut of each variety and then 
pick the remaining seven doughnuts any way they pleased. There is no choice (only 
one way) in picking one doughnut of each type. The choice occurs in picking the 
remaining seven doughnuts from the five types. Think of placing one doughnut of 
each type in a box and then covering them with a sheet of waxed paper, and then 
choosing the remaining seven doughnuts. The variety in outcomes involves only the 
seven doughnuts chosen to go on top of the waxed paper. So the answer by Theorem 
2 is C(7 + 5 — 1, 7) = 330. (Note that the set composition principle does not apply 
here, because the objects are not all distinct.) ■ 

Note that in the selection with repetition, we are only concerned with counting 
how many of each type we have. With this objective, it is easy to handle lower bounds 
for the types (as in Example 6), since this constraint recasts the problem into counting 
how many more than the lower bound we have of each type. However, in arrangement 
with repetition, lower bound constraints are much more complex. For each outcome 
in the selection with repetition problem — that is, when we consider specific amounts 
of each type, we have a subproblem of counting how many ways there are to arrange 
the items in this selection. 

Example 7: Selections with Lower and Upper Bounds 

How many ways are there to pick a collection of exactly 10 balls from a pile of red 
balls, blue balls, and purple balls if there must be at least five red balls? If at most 
five red balls? 

The first problem is similar to Example 6. We put five red balls in a box, cover them 
with waxed paper, and then select five more balls arbitrarily (possibly including more 
red balls). The five balls above the waxed paper can be chosen in C(5 + 3 — 1 , 5) = 21 
ways. 

There is no direct way to count selections when there is an upper bound of the 
number of repetitions of some object. To handle the constraint of at most five red 
balls, we count the complementary set. Of all C(10 + 3 — 1, 10) = 66 ways to pick 
1 0 balls from the three colors without restriction, there are C(4 + 3 — 1,4)= 15 ways 
to choose a collection with at least six red balls (put six red balls in a box, cover with 
waxed paper, and then arbitrarily choose four more balls). So there are 66 — 15 = 51 
ways to choose 10 balls without more than five red balls. ■ 


Example 8: Arrangements with Restricted Positions 

We return to Example 1 about arrangements of the letters in banana, but now with 
some constraints of the sort encountered in Section 5.2. How many arrangements are 
there of the letters b, a, n, a, n, a such that: 
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(a) The h is followed (immediately) by an a: We use the method for counting ar- 
rangements with consecutive letters introduced in Example 2 of Section 5.2; that 
is, we glue the b and one of the as together to form the multiletter ba. Now we 
want to count all arrangements of the five “letters”: ba, a, a, n, n. By Theorem 1, 
there are 5!/l!2!2! = 30 arrangements. 

(b) The pattern bnn never occurs: We solve this problem by counting all arrangements 
of b, a, n, a, n, a without constraint and then subtracting off the arrangements 
with the pattern bnn. Arrangements with this pattern are handled the same way 
as the pattern ba in part (a). We treat bnn as a single multiletter and now count 
arrangements of the four letters bnn , a , a, a — 4 !/ 1 !3! = 4 ways. Subtracting the 
four forbidden arrangements from all 6!/l!3!2! = 60 arrangements yields the 
answer, 60 — 4 = 56. 

(c) The b occurs before any of the as (not necessarily immediately before an a): in 
other words, the relative order of the b and 3 as is b-a-a-a. We use the method 
of handling relative order introduced in the continuation of Example 2 in Section 
5.2. We pick a subset of four positions (for these four letters) from the six positions 
in an arrangement — C(6, 4) = 15 ways. We put the b, a, a, a in these four positions 
in the required relative order — one way. Next we fill the two remaining positions 
in the arrangement with the two ns — one way. So the final answer is 15 x 1 x 


1 = 15. ■ 


5.3 EXERCISES 



' Summary of Exercises Most of these problems are not-too- 
difficult variations on the section’s examples. Be careful to distinguish between 
arrangements and subsets problems. 

1. How many ways are there to roll a die seven times and obtain a sequence of 
outcomes with three Is, two 5s, and two 6s? 

2. How many ways are there to arrange the letters in STATISTICAL? 

3. (a) How many 8-digit numbers can be formed with the digits 3, 5, and 7? 

(b) What fraction of the numbers in part (a) have three 3s, two 5s, and three 7s? 

4. How many ways are there to invite one of four different friends over for dinner 
on five successive nights such that no friend is invited more than three times? 

5. How many ways are there to pick a collection of nine coins from piles of pennies, 
nickels, dimes, and quarters? 

6. If four identical dice are rolled, how many different outcomes can be recorded? 

7. How many ways are there to select a committee of 17 politicians chosen from 
a room full of indistinguishable Democrats, indistinguishable Republicans, and 
indistinguishable Independents if every party must have at least two members on 
the committee? If, in addition, no group may have a majority of the committee 
members? 
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8. Ten different people walk into a delicatessen to buy a sandwich. Four always 
order tuna fish, two always order chicken, two always order roast beef, and two 
order any of the three types of sandwich. 

(a) How many different sequences of sandwiches are possible? 

(b) How many different (unordered) collections of sandwiches are possible? 

9. How many ways are there to pick a selection of coins from $1 worth of identical 
pennies, $1 worth of identical nickels, and $1 worth of identical dimes if 

(a) You select a total of 9 coins? 

(b) You select a total of 16 coins? 

Comment on the answers: 



10. (a) How many ways are there to distribute seven identical apples and six identical 

pears to three distinct people such that each person has at least one pear? 

(b) How many ways are there to distribute seven distinct applies and six distinct 
pears to three distinct people such that each person has at least one pear? 

11. How many ways are there to have a collection of eight fruits from a large pile 
of identical oranges, apples, bananas, peaches, and pears if the collection should 
include exactly two different kinds of fruits? 

12. How many ways are there to pick nine balls from large piles of (identical) red, 
white, and blue balls plus one pink ball, one lavender ball, and one tan ball? 

13. How many numbers greater than 3,000,000 can be formed by arrangements of 
1,2, 2, 4, 6, 6, 6? 

14. How many different rth-order partial derivatives does f{x \ , X 2 , ■ ■ ■ , x n ) have? 

15. How many 8-digit sequences are there involving exactly six different digits? 

16. How many 9-digit numbers are there with twice as many different odd digits 
involved as different even digits (e.g., 945222123 with 9, 3, 5, 1 odd and 2, 4 
even). 

17. How many arrangements are possible with five letters chosen from 
MISSISSIPPI? 

18. How many ways are there to select an unordered group of eight numbers 
between 1 and 25 inclusive with repetition? In what fraction of these ways is 
the sum of these numbers even? 

19. How many arrangements of letters in REPETITION are there with the first E 
occurring before the first T? 
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20. In an international track competition, there are five United States athletes, four 
Russian athletes, three French athletes, and one German athlete. How many 
rankings of the 13 athletes are there when 

(a) Only nationality is counted? 

(b) Only nationality is counted and all the U.S. athletes finish ahead of all the 
Russian athletes? 

21. How many arrangements of the letters in MATHEMATICS are there in which 
TH appear together but the TH is not immediately followed by an E (not THE)? 

22. How many arrangements of the letters in PEPPERMILL are there with 

(a) The M appearing to the left of all the vowels? 

(b) The first P appearing before the first L? 

23. How many arrangements of the letters in MISSISSIPPI in which 

(a) The M is followed (immediately) by an I? 

(b) The M is beside an I — that is, an I is just before or just after the M. Possibly 
there is an I both just before and just after the M — special care is required to 
make sure you count arrangements with the pattern IMI correctly. 

24. How many arrangements of PREPOSTEROUS are there in which the five vowels 
are consecutive? 

25. How many ways to select a subset of eight doughnuts from three types of dough- 
nuts if at most two doughnuts of the first type can be chosen? 

26. How many sequences of outcomes are possible if one rolls two identical dice 10 
successive times? 

27. How many ways are there to split a group of 2 n as, 2 n /is, and 2 n y s in half 
(into two groups of 3 n letters)? (Note: The halves are unordered; there is no first 
half.) 

28. How many ways are there to place nine different rings on the four fingers of your 
right hand (excluding the thumb) if 

(a) The order of rings on a finger does not matter? 

(b) The order of rings on a finger is considered? (Hint: Tricky.) 

29. Show that 'EP(10,k\, k 2 , k 3) = 3 10 , where k\, ^2, £3 are nonnegative integers 
ranging over all possible triples such that k\ + £2 + £3 = 10. 

30. How many arrangements of six Os, five Is, and four 2s are there in which 

(a) The first 0 precedes the first 1? 

(b) The first 0 precedes the first 1, which precedes the first 2? 

31. How many arrangements are there of 4 n letters, four of each of n types of letters, 
in which each letter is beside a similar letter? 

32. How many ways are there for 10 people to have five simultaneous telephone 
conversations? 
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33. When a coin is flipped n times, what is the probability that 

(a) The first head comes after exactly m tails? 

(b) The ith head comes after exactly m tails? 

34. How many arrangements are there of TINKERER with two but not three 
consecutive vowels? 

35. How many arrangements are there of seven as, eight bs, three cs, and six c/s with 
no occurrence of the consecutive pairs ca or cc? 

36. How many arrangements of five as, five /3s, and five ys are there with at least 
one p and at least one y between each successive pair of as? 

37. (a) Use induction to give a rigorous proof of Theorem 1 (Proof 1). 

(b) Use induction to prove that 



5.4 DISTRIBUTIONS 

Generally a distribution problem is equivalent to an arrangement or selection problem 
with repetition. Specialized distribution problems must be broken up into subcases 
that can be counted in terms of simple permutations and combinations (with and 
without repetition). General guidelines for modeling distribution problems are 

Distributions of distinct objects are equivalent to arrangements 

and 

Distributions of identical objects are equivalen t to selections 


Basic Models for Distributions 

Distinct Objects The process of distributing r distinct objects into n different 
boxes is equivalent to putting the distinct objects in a row and stamping one of 
the n different box names on each object. The resulting sequence of box names 
is an arrangement of length r formed from n items (box names) with repeti- 
tion. Thus there are n x n x • ■ ■ x n (r ns) = n r distributions of the r distinct 
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objects. If r,- objects must go in box i, 1 < i < n, then there are P(r; r 1; r 2 , . . . , r„) 
distributions. 

Identical Objects The process of distributing r identical objects into n differ- 
ent boxes is equivalent to choosing an (unordered) subset of r box names with 
repetition from among the n choices of boxes. Thus there are C(r + n — l,r) = 
(r + n — 1 )\/rl(n — 1)! distributions of the r identical objects. 


Example 1: Assigning Diplomats 

How many ways are there to assign 100 different diplomats to five different continents? 
How many ways if 20 diplomats must be assigned to each continent? 

According to the model for distributions of distinct objects, this assignment equals 
the number of sequences of length 100 involving the five continental destinations — 
5 100 such sequences (think of the diplomats lined up in a row holding attache cases 
stamped with their destination). The constraint that 20 diplomats go to each continent 
means that each continent name should appear 20 times in the sequence. This can be 
done P(100; 20, 20, 20, 20, 20) = 100!/(20!) 5 ways. ■ 


Example 2: Bridge Hands 

In bridge, the 52 cards of a standard card deck are randomly dealt 13 apiece to players 
North, East, South, and West. What is the probability that West has all 13 spades? 
That each player has one Ace? 

There are P( 52; 13, 13, 13, 13) distributions of the 52 cards into four different 
13-card hands, using the same reasoning as in the preceding example. Distributions 
in which West gets all the spades may be counted as the ways to distribute to West 
all the spades — one way — times the ways to distribute the 39 non-spade cards among 
the three other hands — P(39; 13, 13, 13) ways. So the probability that West has all 
the spades is 



The simple form of this answer can be directly obtained by considering West’s 
possible hands alone (ignoring the other three hands). A random deal gives West one 
of the C(52, 13) possible 13-card hands. Thus, the unique hand of 13 spades has 
probability 1/C(52, 13). 

To count the ways in which each player gets one Ace, we divide the distribution 
up into an Ace part — four! ways to arrange the four Aces among the four players — and 
a non-Ace part — P(48; 12, 12, 12, 12) ways to distribute the remaining 48 non-Ace 
cards, 12 to each player. So the probability that each player gets an Ace is 


4148! /J2! = 13!^ x 4148! = //52X 

( 1 2 ! ) 4 / ( 1 3 ! ) 4 12 ! 4 52! /V 4 / 


0.105 ■ 
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Example 3: Distributing Candy 

How many ways are there to distribute 20 (identical) sticks of red licorice and 15 
(identical) sticks of black licorice among five children? 

Using the identical objects model for distributions, we see that the ways to dis- 
tribute 20 identical sticks of red licorice among five children is equal to the ways 
to select a collection of 20 names (destinations) from a set of five different names 
with repetition. This can be done C(20 + 5— 1,20)= 10,626 ways. By the same 
modeling argument, the 15 identical sticks of black licorice can be distributed in 
C(15 + 5— 1,15) = 3876 ways. The distributions of red and of black licorice are 
disjoint procedures. The multiplication principle applies and so the number of 
ways to distribute red and black licorice is 10,626 x 3876 = 41,186,376. ■ 


Example 4: Distributing a Combination of Identical and Distinct Objects 

How many ways are there to distribute four identical oranges and six distinct apples 
(each a different variety) into five distinct boxes? In what fraction of these distributions 
does each box get exactly two objects? 

There are C(4 + 5 — 1 , 4) = 70 ways to distribute four identical oranges into five 
distinct boxes and 5 6 = 15,625 ways to put six distinct apples in five distinct boxes. 
These two processes are disjoint, and so there are 70 x 15,625 = 1,093,750 ways to 
distribute the four identical and six distinct fruits. 

The additional constraint of two objects in each box substantially complicates 
matters. To count constrained distributions of distinct objects, we must know exactly 
how many of the distinct objects should go in each box. But in this problem, the 
number of distinct objects that can go in a box depends on how many identical 
objects are in the box. So we deal with the (identical) oranges first. There are three 
possible categories of distributions of the four oranges (without exceeding two in 
any box). 

Case 1 Two (identical) oranges in each of two boxes and no oranges in the other 
three boxes. The two boxes to get the pair of oranges can be chosen in C(5, 2) = 10 
ways, and the six (distinct) apples can then be distributed in those three other boxes, 
two to a box, in P(6\ 2, 2, 2) = 90 ways. So Case 1 has 10 x 90 = 900 possible 
distributions. 

Case 2 Two (identical) oranges in one box, one orange in each of two other 
boxes, and the remaining two boxes empty. The one box with two oranges can be 
chosen C(5, 1) = 5 ways, and the two boxes with one orange can be chosen in 
C(4, 2) = 6 ways [or combining these two steps, we can think of arranging the num- 
bers 2, 1, 1, 0, 0, among the five boxes in P( 5; 1, 2, 2) = 30 ways]. Then the two 
boxes still empty will each get two apples and the two boxes with one orange will get 
one apple. Thus, six distinct apples can then be distributed in P( 6; 2, 2, 1, 1) = 180 
ways, for a total of 5 x 6 x 180 = 5400 ways. 
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Case 3 One orange each in four of the five boxes. This can be done in C( 5, 4) = 
five ways, and then the apples can be distributed in P( 6 ; 2, 1, 1, 1, 1) = 360 ways, for 
a total of 5 x 360 = 1800 ways. 

Summing these cases, there are 900 + 5400+ 1800 = 8100 distributions with 
two objects in each box. The fraction of such distributions among all ways to distribute 
the four oranges and six apples is 8100/1,093,750 = .0074 — a lot smaller than one 
might guess. (If all 10 fruits were distinct, the fraction with two in each box would 
be about 50 percent larger. Why?) ■ 

Example 5: Distributing Balls 

Show that the number of ways to distribute r identical balls into n distinct boxes with 
at least one ball in each box is C(r — 1, n — 1). With at least r\ balls in the first box, 
at least r 2 balls in the second box, . . . , and at least r n balls in the nth box, the number 
is C(r — r\ — r 2 — ■ ■ ■ — r„ + n — 1, n — 1). 

The requirement of at least one ball in each box can be incorporated into an 
equivalent selection-with-repetition model (as in the doughnut selection problem in 
Section 5.3). One could form such a collection of box destinations by putting one 
label for each of the n boxes on a tray, covering these labels with a piece of waxed 
paper, and then picking the remaining r — n labels in all possible ways. 

Alternatively, we can handle this constraint directly in terms of the boxes as 
follows. First put one ball in each box and then put a false bottom in the boxes to 
conceal the ball in each box. Now it remains to count the ways to distribute without 
restriction the remaining r — n balls into the n boxes. 

With either approach, the answer is 

[(r — n) + n — 11 ! 

C((r -n) + n- l,(r-n))= — — = C(r-l,n- l)ways 

(r — n)\(n — 1 )! 

In the case where at least r, balls must be in the z'th box, we first put r, balls in 
the zth box, and then distribute the remaining r — r\ — r 2 — ■ ■ ■ — r n balls in any way 
into the n boxes. This can be done in 

C((r -r x — r 2 r n ) + n - 1, (r - r x - r 2 r n )) 

= C((r — r\ — r 2 — ■ r n ) + n — 1 , n — 1 ) ways ■ 


The next counting problem will play a critical role in building generating func- 
tions in the next chapter. 

Example 6: Integer Solutions 

How many integer solutions are there to the equation xi +X 2 + X 3 +X 4 = 12, with 
Xi > 0? How many solutions with Xi > 1 ? How many solutions with xi > 2, x 2 > 
2, X 3 > 4, X 4 > 0? 
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By an integer solution to this equation, we mean an ordered set of integer values 
for the x,s summing to 12, such as xi = 2, X 2 = 3, X 3 = 3, X 4 = 4. We can model 
this problem as a distribution-of-identical-objects problem or as a selection- with- 
repetition problem. Let x, represent the number of (identical) objects in box i or the 
number of objects of type i chosen. The integer on the right side of the equation is 
the number of objects to be distributed or selected. Using either of these models, we 
see that the number of integer solutions is C(12 + 4— 1, 12) = 455. 

Solutions with Xi > 1 correspond in these models to putting at least one object in 
each box or choosing at least one object of each type. Solutions with xi > 2, X 2 > 2, 
X 3 > 4, X 4 > 0 correspond to putting at least two objects in the first box, at least 
two in the second, at least four in the third, and any number in the fourth (or 
equivalently in the selection-with-repetition model). Formulas for these two types 
of distribution problems were given in Example 5. The respective answers are 
C(12 - 1, 4 - 1) = 165 and C((12 - 2- 2- 4) + 4-1, 4-1) = C(7, 3) = 
35. ■ 

Equations with integer-valued variables are called diophantine equations. They 
are named after the Greek mathematician Diophantus, who studied them 2,250 
years ago. 

We have now encountered three equivalent forms for selection with repetition 
problems. 


Equivalent Forms for Selection with Repetition 

1. The number of ways to select r objects with repetition from n different 
types of objects. 

2. The number of ways to distribute r identical objects into n distinct boxes. 

3. The number of nonnegative integer solutions to xi + X 2 H — ■ + x n = r. 


It is important that the reader be able to restate a problem given in one of the 
foregoing settings in the other two. Many students find version 2 the most convenient 
way to look at such problems because a distribution is easiest to picture on paper 
(or in one’s head). Indeed, the original argument with order forms for hot dogs used 
to derive our formula for selection with repetition was really a distribution model. 
Version 3 is the most general (abstract) form of the problem. It is the form needed in 
Chapter 6 to build generating functions. 

We next present three problems that we solve by recasting as problems involving 
distributions of identical or distinct objects. 


Example 7: Ingredients for a Witch’s Brew 

A warlock goes to a store with $5 to buy ingredients for his wife’s Witch’s Brew. 
The store sells bat tails for 254 apiece, lizard claws for 254 apiece, newt eyes for 


5.4 Distributions 219 


25 4 apiece, and calf blood for $ 1 a pint bottle. How many different purchases (subsets) 
of ingredients will $5 buy? 

The first step is to make our unit of money 25</ (a quarter). So the warlock has 
20 units to spend, with blood costing 4 units and the other three items each costing 
1 unit. An integer-solutions-of-equation model of this problem is 

T + S+E + 4B = 20, T,S,E,B> 0 


The simplest way to handle the fact that B has a coefficient of 4 is to break into 
cases by specifying exactly how much blood is bought. If 1 pint is bought, B = 1, 
then we have 7’ + S + E = 16, an equation with C(16 + 3 — 1, 16)= 153 nonnegative 
integer solutions. In general, if B = i, then we have T + S + E = 20 — 4/, an equation 
with C((20 - 4i) + 3 - 1, 20 - 4i) = C(22 - 4 i, 2) solutions, for / = 0, 1,2, 3, 4, 5. 
Summing these possibilities, we obtain 





536 different purchases ■ 


Example 8: Binary Patterns 

What fraction of binary sequences of length 10 consists of a (positive) number of Is, 
followed by a number of 0s, followed by a number of Is, followed by a number of 
0s? An example of such a sequence is 111011 1000. 

There are 2 10 = 1024 10-bit binary sequences. We model the problem of counting 
this special type of 10-bit binary sequences as a distribution problem as follows. We 
create four distinct boxes, the first box for the initial set of Is, the second box for 
the following set of 0s, and so on. We have 10 identical markers (call them xs) to 
distribute into the four boxes. Each box must have at least one marker, since each 
subsequence of 0s or of Is must be nonempty. By Example 5 the number of ways to 
distribute lOxs into four boxes with no box empty is C(10 — 1, 4 — 1) = 84. So there 
are 84 such binary sequences and thus 84/1024 ^ 8% of all 10-bit binary sequences 
are of this special type. ■ 


There are three constraints in arrangement problems that are handled by “tricks.” 
One is requiring certain elements to be consecutive; this is handled by gluing the 
elements together. The second is requiring one element to occur before another, or 
more generally, specifying the relative order of a subset of elements; this is handled 
by picking the subset of positions in the arrangement where the ordered elements will 
appear. The next problem presents the third, and most complex, “trick” constraint — 
namely, requiring certain elements to be nonconsecutive (i.e., never side by side). 

Example 9: Nonconsecutive Vowels 

How many arrangements of the letters a, e, i, o, u, x, x, x, x, x, x, x, x (eight xs) are 
there if no two vowels can be consecutive? 
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The right question to start with is: in which positions in the arrangement will 
the xs appear to ensure the us ( v = vowel) are nonconsecutive. Observe that once 
we know where the us goes, it is easy to count the ways to order the five vowels in 
those positions. In a fashion similar to the preceding binary patterns example, the 
placement of xs can be modeled by creating boxes before, between, and after the us 
and distributing the xs into the six resulting boxes with at least one x in the middle four 
boxes (to ensure that no two vowels are consecutive). The following pattern illustrates 
one possible distribution of the xs: 

v X x V x v x V v x 

box 1 box 2 box 3 box 4 box 5 box 6 

Initially we put one x in boxes 2, 3, 4, 5. Then we distribute the remaining four xs 
into the six boxes without constraint — C( 4 + 6 — 1,4)= 126 ways. This counts all 
possible patterns of xs and us (the positions of the us are forced by the way we choose 
the xs). 

Next in each pattern, we arrange a, e, i, o, u in 5! = 120 ways among the five 
positions with us. In total, there are 120 x 126 = 15,120 arrangements. ■ 


We close this section with a table summarizing the different basic types of count- 
ing problems we have encountered in this chapter. 


Ways to Arrange, Select, or Distribute r Objects from n 
Items or into n Boxes 


Arrangement 

Combination 

( Ordered Outcome) 

(Unordered Outcome) 

or 

or 

Distribution of 

Distribution of 

Distinct Objects 

Identical Objects 


No repetition Pin, r ) C(n, r) 

Unlimited repetition n r C(n + r — 1, r) 

Restricted repetition P (n ; r \ , r 2 , . . . r m ) — 



5.4 EXERCISES 


Summary of Exercises Be careful to distinguish whether a prob- 
lem involves distinct or identical objects. Exercises 3 1-34 involve problem restate- 
ment (no actual numerical answers are sought). Exercise 51 presents an important 
alternative approach for analyzing nonconsecutivity problems. 


1. How many ways are there to distribute 27 identical jelly beans among three 
children: 

(a) Without restrictions? 

(b) With each child getting nine beans? 

(c) With each child getting at least one bean? 
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2. How many ways are there to distribute 18 different toys among four children? 

(a) Without restrictions? 

(b) If two children get seven toys and two children get two toys? 

3. In a bridge deal, what is the probability that: 

(a) West has four spades, two hearts, four diamonds, and three clubs? 

(b) North and South have four spades. West has three spades, and East has two 
spades? 

(c) One player has all the Aces? 

(d) All players have a (4, 3, 3, 3) division of suits? 

4 . How many ways are there to distribute eight (identical) apples, six oranges, and 
seven pears among three different people 

(a) Without restriction? 

(b) With each person getting at least one pear? 

5. How many ways are there to distribute 18 chocolate doughnuts, 12 cinnamon 
doughnuts, and 14 powdered sugar doughnuts among four school principals if 
each principal demands at least two doughnuts of each kind? 

6. How many distributions of 1 8 different objects into three different boxes are there 
with twice as many objects in one box as in the other two combined? 

7. How many ways are there to arrange the letters in VISITING with no pair of 
consecutive Is? 

8. How many ways are there to arrange 12 identical apples and five different oranges 
in a row so that no two oranges will appear side by side? 

9. (a) How many arrangements of the letters in COMBINATORICS have no con- 

secutive vowels? 

(b) In how many of the arrangements in part (a) do the vowels appear in alpha- 
betical order? 

10 . How many ways are there to arrange the 26 letters of the alphabet so that no pair 
of vowels appear consecutively (Y is considered a consonant)? 

11 . If you flip a coin 18 times and get 14 heads and four tails, what is the probability 
that there is no pair of consecutive tails? 

12 . How many integer solutions are there to x\ + X 2 + X3 + -*4 + *5 = 3 1 with 

(a) x; > 0 (b) xi > 0 (c) x; > i(i = 1, 2, 3, 4, 5) 

13 . How many integer solutions are there to x\ + X2 + X3 = 0 with x,- > —5? 

14 . How many positive integer solutions are there to xi + X2 + X3 + X4 < 50? 

15 . How many ways can a deck of 52 cards be broken up into a collection of unordered 
piles of sizes 

(a) Four piles of 13 cards? 

(b) Three piles of eight cards and four piles of seven cards? 
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16 . Consider the problem of distributing 10 distinct books among three different 
people with each person getting at least one book. Explain why the following 
solution strategy is wrong: first select a book to give to the first person in 10 
ways; then select a book to give to the second person in nine ways; then select 
a book to give to the third person in eight ways; and finally distribute the 
remaining seven books in 7 3 ways. 

17 . Each of 10 employees brings one (distinct) present to an office party. Each 
present is given to a randomly selected employee by Santa (an employee can 
get more than one present). What is the probability that at least two employees 
receive no presents? 

18 . How many ways are there to distribute k balls into n distinct boxes ( k <n) with 
at most one ball in any box if 

(a) The balls are distinct? 

(b) The balls are identical? 

19 . How many ways are there to distribute three different teddy bears and nine 
identical lollipops to four children 

(a) Without restriction? 

(b) With no child getting two or more teddy bears? 

(c) With each child getting three “goodies”? 

20 . Suppose that 30 different computer games and 20 different toys are to be 
distributed among three different bags of Christmas presents. The first bag is 
to have 20 of the computer games. The second bag is to have 15 toys. The third 
bag is to have 15 presents, any mixture of games and toys. How many ways are 
there to distribute these 50 presents among the three bags? 

21 . Suppose a coin is tossed 12 times and there are three heads and nine tails. How 
many such sequences are there in which there are at least five tails in a row? 

22 . How many binary sequences of length 20 are there that 

(a) Start with a run of 0s — that is, a consecutive sequence of (at least) one 0 — 
then a run of Is, then a run of 0s, then a run of Is, and finally finish with a 
run of 0s? 

(b) Repeat part (a) with the constraint that each run is of length at least 2. 

23 . How many binary sequences of length 18 are there that start with a run of 
Is — that is, a consecutive sequence of (at least) one 1 — then a run of 0s, then a 
run of Is, and then a run of 0s, such that one run of Is has length at least 8? 

24 . What fraction of all arrangements of EFFLORESCENCE has consecutive Cs 
and consecutive Fs but no consecutive Es? 

25 . How many arrangements of MISSISSIPPI are there with no pair of consecu- 
tive Ss? 

26. How many ways are there to distribute 15 identical objects into four different 
boxes if the number of objects in box 4 must be a multiple of 3? 
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27. If n distinct objects are distributed randomly into n distinct boxes, what is the 
probability that 

(a) No box is empty? 

(b) Exactly one box is empty? 

(c) Exactly two boxes are empty? 

28. How many ways are there to distribute eight balls into six boxes with the first 
two boxes collectively having at most four balls if 

(a) The balls are identical? 

(b) The balls are distinct? 

29. In Example 4, if all 10 pieces of fruit were distinct, what would be the fraction 
of outcomes with two pieces of fruit in each box? Why is this fraction greater 
when the pieces of fruit are distinct? 

30. (a) How many ways are there to make 35 cents change in 1952 pennies, 1959 

pennies, and 1964 nickels? 

(b) In 1952 pennies, 1959 pennies, 1964 nickels, and 1971 quarters? 

31. State an equivalent distribution version of each of the following arrangement 
problems: 

(a) Arrangements of eight letters chosen from piles of as, bs, and cs 

(b) Arrangements of two as, three bs, four cs 

(c) Arrangements of 10 letters chosen from piles of as, bs, cs, and c/s with the 
same number of as and bs 

(d) Arrangements of four letters chosen from two as, three bs, and four cs 

32. State an equivalent arrangement version of each of the following distribution 
problems: 

(a) Distributions of n distinct objects into n distinct boxes 

(b) Distributions of 15 distinct objects into five distinct boxes with three objects 
in each box 

(c) Distributions of 15 distinct objects into five boxes with i objects in the ith 
box, i= 1, 2, 3, 4, 5 

(d) Distributions of 12 distinct objects into three distinct boxes with at most 
three objects in box 1 and at most five objects in box 2 

33. State an equivalent distribution version and an equivalent integer-solution-of- 
an-equation version of the following selection problems: 

(a) Selections of six ice cream cones from 3 1 flavors 

(b) Selections of five marbles from a group of five reds, four blues, and two 
pinks 

(c) Selections of 12 apples from four types of apples with at least two apples of 
each type 
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(d) Selections of 20 jelly beans from four different types with an even number 
of each type and not more than eight of any one type 

34 . State an equivalent selection version and an equivalent integer-solution-of- 
an-equation version of the following distribution problems: 

(a) Distributions of 30 black chips into five distinct boxes 

(b) Distributions of 18 red balls into six distinct boxes with at least two balls in 
each box 

(c) Distributions of 20 markers into four distinct boxes with the same number 
of markers in the first and second boxes 

35 . How many election outcomes are possible (numbers of votes for different 
candidates) if there are three candidates and 30 voters? If, in addition, some 
candidate receives a majority of the votes? 

36 . How many election outcomes in the race for class president are there if there 
are five candidates and 40 students in the class and 

(a) Every candidate receives at least two votes? 

(b) One candidate receives at most one vote and all the others receive at least 
two votes? 

(c) No candidate receives a majority of the votes? 

(d) Exactly three candidates tie for the most votes? 

37 . How many numbers between 0 and 10,000 have a sum of digits 

(a) Equal to 7? (b) Less than or equal to 7? (c) Equal to 13? 

38 . How many integer solutions are there to the equation x\ + X2 + X3 + X4 < 15 
with X[ > — 10? 

39 . How many nonnegative integer solutions are there to the equation 2xi + 2x2+ 

X3 +X4 = 12? 

40 . How many nonnegative integer solutions are there to the pair of equations 
xi + X2 + • ■ ■ + xg = 20 and x\ + X2 + X3 = 7? 

41 . How many nonnegative integer solutions are there to the inequalities 
jei + X 2 + • — b xg < 20 and xi + X2 + X3 <7? 

42 . How many nonnegative integer solutions are there to xi + X2 H — — H X5 = 20 

(a) With x ; < 10? (b) With x,- < 8? (c) With x, = 2x 2 ? 

43 . How many ways are there to split four red, five blue, and seven black balls among 

(a) Two boxes without restriction? 

(b) Two boxes with no box empty? 

44 . How many ways are there to arrange the letters in UNIVERSALLY so that the 
four vowels appear in two clusters of two consecutive letters with at least two 
consonants between the two clusters? An example of such an arrangement is 
LNUILYSVAER. 
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45. How many ways are arrange the letters in UNIVERSALLY so that no two vowels 
occur consecutively and also the consonants appear in alphabetical order? 

46. How many 8-letter arrangements can be formed from the 26 letters of the 
alphabet (without repetition) that include at most three of the five vowels and in 
which the vowels are nonconsecutive? 

47. How many arrangements of letters in INSTITUTIONAL have all of the 
following properties simultaneously? 

(a) No consecutive Ts 

(b) The 2 Ns are consecutive 

(c) Vowels in alphabetical order 

48. How many arrangements of the letters in INSTRUCTOR have all of the 
following properties simultaneously? 

(a) The vowels appearing in alphabetical order 

(b) At least 2 consonants between each vowel 

(c) Begin or end with the 2 Ts (the Ts are consecutive) 

49. How many arrangements of the letters in STATISTICS have all of the following 
properties simultaneously? 

(a) No consecutive Ss 

(b) Vowels in alphabetical order 

(c) The 3 Ts are consecutive (appear as 3 Ts in a row) 

50. (a) How many arrangements are there of REVISITED with vowels not in 

increasing order — that is, an I before one (or both) of the Es? 

(b) How many arrangements with no consecutive Es and no consecutive Is? 

(c) How many arrangements with vowels not in increasing order and no 
consecutive Es and no consecutive Is? 

51. This exercise presents an alternative approach to counting arrangements with a 
certain type of object not occurring consecutively. Take Exercise 7, which seeks 
arrangements of VISITING with no consecutive Is. Arrange the five consonants 
in 5 ! ways and then pick a subset of three positions (with no repetition) for Is from 
among the six locations before, between, and after the five consonants — done in C 
(6, 3) ways. 

(a) Rework Example 9 using this approach. 

(b) Use this approach to solve Exercise 10. 

52. Among all arrangements of WISCONSIN without any pair of consecutive 
vowels, what fraction have W adjacent to an I? 

53. How many bridge deals are there in which North and South get all the spades? 

54. What is the probability in a bridge deal that each player gets at least three honors 
(an honor is an Ace, or King, or Queen, or Jack)? 
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55. How many ways are there to distribute 15 distinct oranges into three different 
boxes with at most eight oranges in a box? 

56. How many ways are there to distribute 20 toys to m children such that the first 
two children get the same number of toys if 

(a) The toys are identical? (b) The toys are distinct? 

57. How many ways are there to distribute 25 different presents to four people 
(including the boss) at an office party so that the boss receives exactly twice as 
many presents as the second most popular person? 

58. How many ways are there to distribute r identical balls into n distinct boxes with 
exactly m boxes empty? 

59. How many subsets of six integers chosen (without repetition) from 1,2, . . . , 20 
are there with no consecutive integers (e.g., if 5 is in the subset, then 4 and 6 
cannot be in it)? 

60. How many arrangements are there of eight as, six /Is, and seven ys in which 
each a is beside (on at least one side) another a? (Hint: Watch out for two 
clusters of as occurring consecutively.) 

61. How many arrangements are there with n Os and m Is, with k runs of Os? [A run 
is a consecutive set (1 or more) of the same digit; e.g., 000 1 1 10100 has three 
(underlined) runs of 0s.] 

62. What is the probability that a random arrangement of a deck of 52 cards has 
exactly k runs of hearts (see Exercise 61 for a definition of a run)? 

63. How many binary sequences of length n are there that contain exactly m 
occurrences of the pattern 01? 

64. How many ways are there to distribute 20 distinct flags onto 1 2 distinct flagpoles if 

(a) In arranging flags on a flagpole, the order of flags from the ground up makes 
a difference? 

(b) No flagpole is empty and the order on each flagpole is counted? 

65. How many ways are there to distribute r identical balls into n distinct boxes with 
the first m boxes collectively containing at least .v balls? 


66. How many ways are there to deal four cards to each of 13 different players so 
that exactly 1 1 players have a card of each suit? 



5.5 BINOMIAL IDENTITIES 

In this section we show why the numbers C(n, r) are called binomial coefficients. 
Then we present some identities involving binomial coefficients. We use three 
techniques to verify the identities: combinatorial selection models, “block walking,” 
and the binomial expansion. The study of binomial identities is itself a major subfield 
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of combinatorial mathematics and we will necessarily just scratch the surface of this 
topic. 

Consider the polynomial expression (a + x) 3 . Instead of multiplying ( a + x ) by 
(a + x) and the product by (a + x) again, let us formally multiply the three factors 
term by term: 

(i a + x)(a + x)(a + x) = aaa + aax + axa + axx + xaa + xax + xxa + xxx 

Collecting similar terms, we reduce the right-hand side of this expansion to 

a 3 + 3a 2 x + 3ax 2 +x 3 (1) 

The formal expansion of (a + x) 3 was obtained by systematically forming all products 
of a term in the first factor, a orx, times a term in the second factor times a term in the 
third factor. There are two choices for each term in such a product, and so there are 
the 2 x 2 x 2 = 8 formal products obtained above. If we were expanding (a +x) 10 , 
we would obtain 2 10 = 1024 different formal products. 

Now we ask the question, how many of the formal products in the expansion 
of ( a + x) 3 contain k xs and (3 — k) as? This question is equivalent to asking for 
the coefficient of a 3 ~ k x k in (1). Since all possible formal products of as and xs are 
formed, and since formal products are just three-letter sequences of as and xs, we 
are simply asking for the number of all three-letter sequences with k xs and (3 — k ) 
as. The answer is C(3, k) and so the reduced expansion for (a +x) 3 can be written 
as 



By the same argument, we see that the coefficient of a n ~ k x k in (a + x)" will be equal 
to the number of n-letter sequences formed by k xs and (n — k ) as, that is, C(n, k). If 
we set a = 1 , we have the following theorem. 

Binomial Theorem 


(l + x) n = 




X 


n 


Another proof of this expansion, using induction, is given in the Exercises. Just 
as important as the binomial theorem is the equivalence we have established between 
the number of k-subsets of n objects and the coefficient of x k in (1 + x)". We exploit 
this equivalence extensively in the next chapter. 

Let us now consider some basic properties of binomial coefficients. The most 
important identity for binomial coefficients is the symmetry identity 

( n\ n\ ( n \ 

k)~ k\(n — k)\ ~\n-k) 


(2) 
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In words, this identity says that the number of ways to select a subset of k objects out 
of a set of n objects is equal to the number of ways to select a group of n — k of the 
objects to set aside (the objects not in the subset). 

The other fundamental identity is 



This identity can be verified algebraically. We will give a combinatorial argument 
instead. We classify the C(n, k) committees of k people chosen from a set of n people 
into two categories, depending on whether or not the committee contains a given 
person P. If P is not part of the committee, there are C(n — 1, k) ways to form the 
committee from the other n — 1 people. On the other hand, if P is on the committee, 
the problem reduces to choosing the k — 1 remaining members of the committee 
from the other n— 1 people. This can be done C(n — 1, k — 1) ways. Thus C(n, k ) = 
C{n — 1, k) + C(n — 1, k — 1). 

The following example presents another binomial identity that can be verified 
algebraically or by a combinatorial argument. 


Example 1 


Show that 



(4) 


The left-hand side of (4) counts the ways to select a group of k people chosen from a 
set of n people and then to select a subset of m leaders within the group of k people. 
Equivalently, as counted on the right side, we could first select the subset of m leaders 
from the set of n people and then select the remaining k — in members of the group 
from the remaining n — m people. Note the special form of (4) when m — 1 : 



For a fixed integer n, the values of the binomial coefficients C(n, k ) increase as 
k increases as long as k < n/2. Then the values of C(n, k) decrease as k increases for 
k>n / 2 + 1 . To verify this assertion, we observe that the binomial coefficients are 
increasing if and only if the ratio C(n,k)/C(n,k— 1) is greater than 1. 

/ n\ n ! 

\k) _ k\(n-k)\ _ n-k+1 

/ n \ n ! k 

v*-lj (*-l)!(n-*+l)! 

So the C(n, k)s are increasing when (n — k+\)/k>\ox equivalently when n — k + 
1 > k. Solving for k in terms of n, we have k < (n + l)/2. This is the same bound on 
integer values of k as k < n/2. 
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Using (3) and the fact that C(«, 0) = C(n, n) = 1 for all nonnegative n, we can 
recursively build successive rows in the following table of binomial coefficients, called 
Pascal’s triangle. Each number in this table, except the first and last numbers in a 
row, is the sum of the two neighboring numbers in the preceding row. 

Table of binomial coefficients: fcth number in row n is 



k = o 

/ 

n = 0 1 k = 1 

/\/ 

n = 1 1 1 k = 2 

/\/\/ 

n = 2 1 2 1 it = 3 

/\/\/\/ 

n = 3 13 3 1 * = 4 

/\/\/\/\/ 

72 = 4 1 4 6 4 1 

Pascal’s triangle has the following nice combinatorial interpretation, due to 
George Polya. Consider the ways a person can traverse the blocks in the map of 
streets shown in Figure 5.1. The person begins at the top of the network, at the spot 
marked (0, 0), and moves down the network (down the page) making a choice at each 
intersection to go right or left (for simplicity, let “right” be your right as you look at this 
page, not the right of the person moving down the network). We label each street comer 
in the network with a pair of numbers ( n , k), where n indicates the number of blocks 
traversed from (0, 0) and k the number of times the person chose the right branch at 
intersections. Figure 5.1 shows a possible route from the start (0, 0) to the corner (6, 3). 


Start 



Figure 5.1 
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Any route to comer (n, k) can be written as a list of the branches (left or right) 
chosen at the successive corners on the path from (0, 0) to (n, k ). Such a list is just a 
sequence of k Rs (right branches) and (n — k ) Ls (left branches). To go to corner (6, 3) 
following the route shown in Figure 5.1, we have the sequence of turns LLRRRL. 

Let s(n, k) be the number of possible routes from the start (0, 0) to corner (n, k ) 
(moving down in the network). This is the number of sequences of k Rs and in — k) 
Ls, and so s(n, k) — C(n, k ). Another useful interpretation of binomial coefficients is 
the committee selection model, used above to verify Eqs. (3) and (4). 

Let us show how our “block-walking” model for binomial coefficients can be 
used to get an alternate proof of identity (3). At the end of a route from the start to 
corner (n, k), a block walker arrives at (n, k) from either corner (n — 1, k) or corner 
(n— 1, k— 1). For example, to get to corner (6, 3) in Figure 5.1, the person either goes 
to corner (5, 3) and branches left to (6, 3), or goes to corner (5, 2) and branches right 
to (6, 3). Thus, the number of routes from (0, 0) to corner (n, k ) equals the number of 
routes from (0, 0) to (n — 1, k ) plus the number of routes from (0, 0) to (n— 1, £—1). 
We have verified identity (3): s{n, k) — s(n — 1, k) + s{n — 1, k — 1). 

We now list seven well-known binomial identities and verify three of them 
(the others are left as exercises). These binomial identities are of much practical 
interest. Expressions involving sums and products of binomial coefficients arise fre- 
quently in complicated real-world counting problems. These identities can be used 
to simplify such expressions. 



Here C(n, r) = 0 if 0 < n < r. These identities can be explained by the 
“committee” type of combinatorial argument used for Eqs. (2), (3), and (4) or by 
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“block-walking arguments.” We give a committee argument for Eq. (6). Consider the 
following two ways of counting all subsets (of any size) of a set of n people: (a) 
summing the number of subsets of size 0, of size 1 , of size 2, and so on — this yields 
the left-hand side of Eq. (6) — and (b) counting all subsets by whether or not the first 
person is in the subset, whether or not the second person is in the subset, and so 
on — this yields 2x2x2. ..x2 {n times) = 2", the right-hand side of (6). A simpler 
proof of (6) is given at the end of this section. 

In general, a combinatorial argument for proving such identities consists of spec- 
ifying a selection counted by the term on the right side, partitioning the selection into 
subcases, and showing that the terms in the summation count the subcases. 

The advantage of proofs involving block walking is that we can draw pictures 
of the “proof.” The picture shows that all the routes to a certain comer — this amount 
is the right-hand side of the identity — can be decomposed in terms of all routes to 
certain intermediate corners (or blocks) — the sum on the left-hand side of the identity. 


Example 2 

Verify identity (8) by block-walking and committee-selection arguments. 

As an example of this identity, we consider the case where r = 2 and n — 6. The 
corners ( k , 2), k = 2, 3, 4, 5, 6, are marked with a * in Figure 5.1 and corner (7, 3) 
is marked with an o. Observe that the right branches at each starred corner are the 
locations of last possible right branches on routes from the start (0, 0) to corner (7, 3). 
After traversing one of these right branches, there is just one way to continue on to 
corner (7, 3), by making all remaining branches left branches. In general, if we break 
all routes from (0, 0) to (n + 1, r + 1) into subcases based on the corner where the 
last right branch is taken, we obtain identity (8). 

We restate the block-walking model as a committee selection: If the kth turn is 
right, this corresponds to selecting the At h person to be on the committee; if the /cth 
turn is left, the f th person is not chosen. We break the ways to pick r + 1 members of 
a committee from n + 1 people into cases depending on who is the last person chosen: 
the (r + l)st, the ( r + 2)nd, . . . , the (n + l)st. If the (r + k + l)st person is the last 
chosen, then there are C(r + k, r ) ways to pick the first r members of the committee. 
Identity (8) now follows. ■ 


Example 3 

Verify identity (9) by a block-walking argument. 

The number of routes from in, k) to (2 n, n) is equal to the number of routes from 
(0, 0) to (n, n — k), since both trips go a total of n blocks with n — k to the right 
(and k to the left). So the number of ways to go from (0, 0) to (n, k) and then on to 
(2 n, n) is C(m, k) x C(n, n — k). By (2), C(m, n—k)= C(n, k ), and thus the number 
of routes from (0, 0) to (2 n, n) via (n, k) is C(n. k) 1 . Summing over all k — that is, 
over all intermediate corners n blocks from the start — we count all routes from (0, 0) 
to (2«, n). So this sum equals C(2n, n), and identity (9) follows. ■ 
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Now we show how binomial identities can be used to evaluate sums whose terms 
are closely related to binomial coefficients. 


Example 4 


Evaluate the sum 1 x 2 x 3 + 2 x 3 x 4 + • — \- ( n — 2)(n — 1 )n. 

The general term in this sum ( k — 2)(k — 1 )k is equal to P(k, 3) = k\/(k — 3)!. 
Recall that the numbers of /--permutations and of /--selections differ by a factor of r! 
That is, Clk, 3) = k\/(k — 3)!3! = P(k, 3)/3!, or P(k, 3) = 3 \C(k, 3). So the given 
sum can be rewritten as 



By identity (8), this sum equals 3! 


cr 


Example 5 

Evaluate the sum l 2 + 2 2 + 3 2 H h n 2 . 

A strategy for problems whose general term is not a multiple of C(n, k ) or Pin, k ) 
is to decompose the term algebraically into a sum of Pin, k )- type terms. In this case, 
the general term k 2 can be written as k 2 = k(k — 1) + k. So the given sum can be 
rewritten as 

[(lx0)+l] + [(2x 1) + 2] + [(3 x2) + 3] + - + [b(« — 1) + b] 


= [(2 x l) + (3 x 2) + • — h n(n — 1)] + (l + 2 + 3 H — • + n) 



by identity (8). 

Note that as part of Example 5, we showed that 1+2 + 3H — ■ + n — C(n + 
l, 2) = + 1), a result we verify by induction in Example 1 of Appendix A. 2. ■ 


There is another simple way to verify certain identities with binomial coefficients. 
We start with the binomial expansion in the binomial theorem. Then we substitute 
appropriate values for x. The following identities can be obtained from the bino- 
mial expansion: (6) by setting x = l, (13), below by setting x = — 1, and (14) by 
differentiating both sides of the binomial expansion and setting x = 1 . 
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or 



(since the sum of both sides is 2") 


n(\+x) n 1 




and so 


n(l + l)" -1 = n2 n ~ l 





(13') 


(14) 


(14') 



5.5 EXERCISES 


Summary of Exercises Combinatorial identities are a well- 
developed field whose surface was barely scratched in this section. The later problems 
in this exercise set go well beyond the level of examples worked in this section. 


1 . 


2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 
9. 


10 . 


(a) Verify identity (3) algebraically (writing out the binomial coefficients in 
factorials). 

(b) Verify identity (4) algebraically. 

(c) Verify that C(n, k)/C(n , k—l) = (n — k + V)/k. 

Verify the following identities by block- walking: 

(a) (6) (b) (7) (c) (10) (d) (11) 


Verify the following identities by a committee-selection model: 

(a) (7) (b) (9) (c) (10) (d) (11) (e) (13) 

Verify the following identities by mathematical induction. [Hint: Use (3)]. 
(a) (3) (b) (5) (c) (6) (d) (7) (e) (13') 

Prove the Binomial theorem by mathematical induction. 

Show that identity (7) can be obtained as a special case of (1 1). 

Show that C(2n, n)+C(2n, n — 1) = \C(2n + 2, n + 1). 

If C(n, 3) + C(n + 3—1, 3)=P(n, 3), find n. 

Show by a combinatorial argument that 



(b) ( n-r ) 


n + r - 
r 


n + r — 1 
2 r 


Show that C(k + m + n, k)C(m + n, m) = (k + m + n)\/(k\m\n\). 
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11. (a) Show that 


(b) Evaluate l 3 + 2 3 + 3 3 H — ■ + n 3 . 

12. (a) Evaluate 12 (k + 1 )k(k — 1). {Hint: Use Example 4.) 

k=0 

(b) Evaluate £(2 + 3A :) 2 . 

k = 0 
n 

(c) Evaluate k(n — k). 

k = 0 

13. (a) Evaluate the sum 


1 + 2 


+ ■■■(&+ 1 ) ( , ) + ••■ + (n + 1 ) 


by breaking this sum into two sums, each of which is an identity in this 
section. 

(b) Evaluate the sum 


14. By setting x equal to the appropriate values in the binomial expansion (or one of 
its derivatives, etc.), evaluate 


(a) S‘-<) 

(e) 


cb) X>-i)(") 

(f) 

s-.o 

,c) £ 2 ‘C) 

(g) 


k=0 


k= o vY 

t a ‘ C) 



show “£C) = C+0 

-( ” Y 


V + V 



m + k 
k 


l ) = ± 


16. Show that 

k= 1 
n— 1 

17. Show that P{m + k, m) = 


n + k 
k 


k = 0 


k=\ 

P(m + n, m + 1) 

( m + 1 ) 


18. (a) Consider a sequence of 2 n distinct people in a line at a cashier. Suppose 
n of the people owe $1 and n of the people are due a $1 payment. Show 
that the number of arrangements in which the cashier never goes into debt 
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19 . 

20 . 


21 . 

22 . 

23 . 

24 . 

25 . 

26 . 
27 . 


28 . 


(i.e., at every stage at least as many people have paid in $1 as were paid 
out $1) is equal to ( 2n ) — ( )J 2 | " 1 ). [Hint: Use a symmetry argument in a block- 
walking type of model to demonstrate a one-to-one correspondence between 
sequences where at some stage the cashier goes (at least) $1 into debt and all 
sequences of 2n people in which n+ 1 of the people are owed $1.] 

(b) Repeat part (a) with the following constraint: if the first person pays $1, 
how many arrangements of the people are possible in which the cashier 
always has a positive amount of money until the last person in line (who is 
owed $1)? 


Show that 


Find the value of k that maximizes 

/ n ^ 

(a) 


(b) 


: 2 n 

= £ 

k=0 

2 n + k 
n 


Evaluate 

jfc=t 

(a) Prove that 


n 

k- 1 


■•• = 2 


+ 4 


2n — k 
n 


(b) What is the value of the sum on each side? 


Evaluate 


n 

k+ 1 


+ • ■■ + (-!)> + D( 

= 1. (Hint: Rewrite as a binomial coefficient with 


n - 1 

Show that £](— 1)* 

k=0 

k alone in the bottom.) 

^ (2 n)\ 

Show that > — r - 

kl 2 (n — k)\ 2 

n 

Give a combinatorial argument to evaluate 

k = 0 

(a) For any given k, show that an integer n can be represented as 


n < m. 


+ 


m 2 

7 + 1 


+ ••• + 


m k 

k 


where 0 < m | <m 2 < ■ ■ ■ < m k, k> 1. 


(b) Prove that for a given k, the representation in part (a) is unique. [Hint: Use 
the fact that 


Show that ( 
7=0 V 


1 = 

-k- j - 1 
k~j 


n— 1 
k 


n — 2 
k — 1 


■7-1 


n — k 
1 

n +m + k — 1 
k 
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^ P(m, k ) 1 ™ ( n -k 

29. Show that V v 7 = — V 

k^o P(n, k ) (J to\ n ~m 


n+ 1 
n — m + 1 


. m < n. 


30. Show that 


m !f n — k)\ 


n + 1 


n\(m — k)\ n — m + 1 


,m<n. 


31. lfC 2n = 


n + 1 V n 


2 n 


, show that C 2m = ^ C 2k - 2 C 2l 


m—2k' 


32. Show that J] (" ) (" k \ = 2" 


k = 0 


m — k 


k= 1 


m < n. 


33. Consider the problem of three-dimensional block walking. Show by a combina- 
torial argument that P{n ; i — 1, j , k) + P(n; i, j — 1, k)+ P(n; i, j, k — 1) = 
P(n + 1; i, j, k). 

34. Give a combinatorial argument to show that 

{x\ + x 2 -\ 1 -XtT — S Pin ; i\, i 2 , . . . , iOx'^xf ■ ■ -x'f 

where the sum is over all 6 + i 2 — ■ + i k = n, ij > 0. 

35. Show for sums over all i\ + i 2 3 — ■ + 4 - = n, ij > 0, that 

(a) EP(n; i u i 2 , . . . , i k ) = k" 

(b) Siii 2 ■ ■ ■ i k P(n; i \ , i 2 , . . . ,i k )= P(n, k)k n ~ k 



y 5.6 SUMMARY AND REFERENCES 

This chapter introduced the basic formulas and logical reasoning used in arrangement 
and selection problems. There were four formulas for arrangements and selections 
with and without repetition, and there were three principles for composing subprob- 
lems. But these formulas and principles were just the building blocks for constructing 
answers to dozens of examples and hundreds of exercises. These problems required 
a thorough, logical analysis before one could begin to decompose them into tractable 
subproblems. Such logical analysis of possibilities arises often in computer science, 
probability, and operations research. It is the basic methodology of discrete mathe- 
matics and the most important skill for a student to develop in this course. Note that 
such logical reasoning is the very essence of mathematical model building. 

The n ! formula for arrangements was known at least 2,500 years ago (see David 
[2] for more details of the history of combinatorial mathematics). Problems involv- 
ing binomial coefficients and the binomial expansion were mentioned in a primitive 
way in Chinese, Hindu, and Arab works 800 years ago. Pascal’s triangle appears in a 
fourteenth-century work of Shih-Chieh (see text cover). The first appearance of the 
triangle in the West was 200 years later. However, it was not until the end of the 
seventeenth century that Jacob Bernoulli gave a careful proof of the binomial the- 
orem. The examination of probabilities related to gambling by Pascal and Fermat 
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around 1650 was the beginning of modern combinatorial mathematics. The formula 
for selection with repetition was discovered soon afterward in the following context: 
the probability when flipping a coin that the nth head appears after exactly r tails is 
(*y + " C(n + r — 1 , r) (there are n “boxes” before and between the occurrences of the 
n heads for placing the r tails). The formula for arrangements with repetition was 
discovered around 1700 by Leibnitz in connection with the multinomial theorem (see 
Exercise 34 of Section 5.5). Jacques Bernoulli’s Ars Conjectandi (1713) was the first 
book presenting basic combinatorial methods. The first comprehensive textbook on 
permutation and combination problems was written by Whitworth [3] in 1901. 

See General References (at end of text) for a list of other introductory texts on 
enumeration. 

1. R. Buck, Advanced Calculus, 3rd ed., McGraw-Hill, New York, 1978. 

2. F. David, Games, Gods, and Gambling: A History of Probability and Statistical 
Ideas, Dover Press, New York, 1998. 

3. W. Whitworth, Choice and Chance, 5th ed. (1901), Hafner Press, New York, 1901. 

SUPPLEMENT: 

SELECTED SOLUTIONS TO PROBLEMS IN CHAPTER 5 


Section 5.1 

Exercise 13 

(a) How many different 6-digit numbers are there (leading zeros, e.g., 00174, not 
allowed)? 

(b) How many even 6-digit numbers are there? 

(c) How many 6-digit numbers are there with exactly one 3? 

(d) How many 6-digit palindromic numbers (numbers that are the same when the 
order of their digits is inverted, e.g., 137731) are there? 

Answer 

(a) All numbers from 100,000 to 999,999 form the 6-digit numbers. Answer is 
9 x 10 x 10 x 10 x 10 x 10. 

(b) For a number to be even, the rightmost position must have an even digit 
(0 or 2 or 4 or 6 or 8). Answer: 9 x 10 x 10 x 10 x 10 x 5. 

(c) If leftmost digit is 3, there are five numbers. If any other digit is 3, 8 x 9 4 numbers. 
Total is 9 5 + 5 x (8 x 9 4 ). 

(d) A palindromic number must have the same digit in the leftmost and rightmost 
position and similarly for the second-leftmost and second-rightmost positions. 
Then the problem becomes one of counting the ways to pick values for three 
leftmost digits (the three right digits are then forced by palindromic symmetry) — 
9x10x10 ways. 
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Exercise 30 

How many times is the digit 5 written when listing all numbers from 1 to 100,000? 
Answer 

It is simpler to answer this question for numbers between 0 and 99,999 (adding 0 and 
omitting 100,000 makes no difference since neither contains a 5). We thus ask how 
many times 5 appears in writing 5-digit sequences (5-digit numbers with leading 0s 
allowed). Consider the following problem: How many times does a 5 occur in the third 
(middle) position in these 5-digit sequences — that is, how many 5-digit sequences are 
there with a 5 in the third position? The answer is 10 4 . For all five positions in the 
sequence, the answer is 5 x 10 4 . 

Exercise 33 

How many four-digit numbers are there formed from the digits 1, 2, 3, 4, 5 (with 
possible repetition) that are evenly divisible by 4? 

Answer 

A number is divisible by 4 if and only if the number formed by its two rightmost 
digits is divisible by 4. Also to be divisible by 4, the Is digit (the rightmost digit) must 
be even — in this problem, 2 or 4. Observe that composing any given 10s digit with 
the two even Is digits, such as 32, 34, we get two consecutive even numbers. Exactly 
one of every two consecutive even numbers is divisible by 4. So to generate a number 
divisible by 4 using digits 1, 2, 3, 4, 5 there can be any of these digits in the 1000s, 
the 100s and 10s positions followed by one choice for the Is position — 5 3 ways. 
Supplement: Selected Solutions to Problems in Chapter 5 

Section 5.2 

Exercise 49 

What fraction of all arrangements of INSTRUCTOR has 

(a) Three consecutive vowels? 

(b) Two consecutive vowels? 

Answer 

Total number of arrangements is C(10, 2) x C(8, 2) x 6! and so the outcomes in parts 
(a) and (b) will be divided by this amount to get the probabilities of these events. 

(a) Replace the sequence of three consecutive vowels by a special symbol V. Now 
we arrange the eight letters, C, N, R, R, S, T, T, V. We pick a pair (subset of size 2) 
of positions for the two Rs, then pick a pair of (remaining) positions for the Ts, 
and then arrange the four distinct letters — C( 8, 2) x C(6, 2) x 4! arrangements. 
Next there are 3 ! arrangements of the three vowels to put in place of V. Answer: 
3! x C(8, 2) x C(6, 2) x 5! 


Supplement: Selected Solutions to Problems in Chapter 5 239 


(b) Initially proceed as in part (a), now with V as a double vowel and V' as a single 
vowel. There are C(9, 2) x C(7, 2) x 5! arrangements of C, N, R, R, S, T, T, V, V' . 
There are three choices for the vowel to be V' and two arrangements of the remain- 
ing vowels for V. In total, 3 x 2 x C( 9, 2) x C(7, 2) x 5! arrangements. However, 
if V is followed (immediately ) by V' or if V' is followed by V, we obtain three 
vowels in a row. Every arrangement with three vowels in a row will be generated 
in two different ways — e.g., TIOUCRRTNS arises from T(IO)(U)CRRTNS and 
T(I)(OU)CRRTNS. Subtracting arrangements with three vowels in a row [part 
(a)], we have [3 x 2 x C( 9, 2) x C( 7, 2) x 5!} - {3! x C(8, 2) x C( 6, 2) x 4!}. 

Exercise 55 

(a) What is the probability that k is the smallest integer in a subset of four different 
numbers chosen from 1 through 20 (1 < k < 17)? 

(b) What is the probability that k is the second smallest? 

Answer 

(a) As in most counting problems, the key here is to focus on the numbers that 
remain to be chosen, not on what one is told must be in the subset. To be 
concrete, initially assume k = 8. For 8 to be the smallest number in the sub- 
set, the other three numbers in the subset must be larger than 8 — chosen in C 
(20 — 8, 3) ways. For a general k, the probability is C(20 — k, 3)/C(20, 4). 

(b) If 8 is the second smallest number in the subset, there is one smaller number — 
chosen in C( 8 — 1, 1) ways — and two larger numbers — chosen in C(20— 8, 2) 
ways. In general, the probability is C(k — 1, 1) x C(20 — k, 2)/C(20, 4). 

Exercise 59 

What is the probability that two (or more) people in a random group of 25 people 
have a common birthday? 

Answer 

The “trick” in this problem is that it is easier to count the probability that no one 
has a common birthday and subtract this probability from 1. We want the fraction 
of possible birthday dates for 25 people in which everyone has a different birthday. 
The denominator, all possibilities of various birthdays for the 25 (different) people, 
is 365 25 . The numerator, the possibilities where everyone has a different birthday, is 
R(365, 25). The desired probability equals 1 — P(365, 25)/365 25 . (With a calculator 
or computer, one determines this probability to be about .57.) 

Exercise 63 

Given a collection of 2 n objects, n identical and the other n all distinct, how many 
different subcollections of n objects are there? 
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Answer 

The trick is to decide which of the n distinct objects will be in the collection. Any subset 
of distinct objects can be chosen in C(«, 0) + C{n, 1) + C(«, 2) H — • + C(n, n) ways, 
with the remaining elements made up of identical objects — done in one way (since 
the objects are identical). An alternative approach is to say that we have the choice to 
use or not use each distinct object — 2" outcomes. 

Exercise 69 

What is the probability that a random 5-card hand has 

(a) Exactly one pair (no three of a kind or two pairs)? 

(b) One pair or more (three of a kind, two pairs, four of a kind, full house)? 

(c) The cards dealt in order of decreasing value? 

(d) At least one spade, at least one heart, no diamonds or clubs, and the values of the 
spades are all greater than the values of the hearts? 

Answer 

The denominator is C(52, 5). 

(a) There are C(13, 1 ) x C(4, 2) ways to pick a pair (two cards of the same kind). To 
fill out the rest of the hand, pick one card of a second kind — 48 ways — then one 
card of a third kind — 44 ways — and finally one card of a fourth kind — 40 ways. 
The problem is that this rest-of-hand count is ordered — that is, the sequence of 
choices 74k, Qv . 54 yields the same rest-of-hand as 54, Qy, 74k. Divide by 3 ! 
to “un-order” this rest-of-hand, yielding the answer: {C(13, 1) x C( 4, 2) x (48 x 
44 x 40/3!)}/C(52, 5). 

Another approach for the rest-of-hand is to pick a subset of three other kinds 
that will appear in the rest-of-hand — C(12, 3) ways — and pick a card of each of 
these kinds — 4 3 ways, yielding {C(13, 1) x C( 4, 2) x C(12, 3) x 4 3 }/C(52, 5). 

(b) Determine the probability of each of the five cards being of a different kind and 
subtract this probability from 1:1 — C(13, 5) x 4 5 /C(52, 5). 

(c) Any 5-card hand has a 1/5! chance of being dealt in a particular order 
(increasing or otherwise). 

(d) Pick a subset of the five kinds (values) that will appear in the hand — C(13, 5) 
ways. The lowest k (1 < k < 4) kinds must be hearts and the other kinds spades. 
It remains only to pick the value of k — four choices: 4 x C(13, 5). (Wasn’t that 
sneaky!) 

Exercise 83 

(a) How may points of intersection are formed by the chords of an n-gon 
(assuming no three of these lines cross at one point)? 

(b) Into how many line segments are the lines in part (a) cut by the intersection points? 
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(c) Use Euler’s formula r = e — v + 2 and parts (a) and (b) to determine the number 
of regions formed by the chords of an «-gon. 

Answer 

(a) Every subset of four vertices of the n-gon forms a unique intersection point 
generated by the intersection of the two chords joining opposite vertices in the 
subset of four vertices. Thus the answer is C(n, 4). 

(b) The number of chords is C(n , 2) — n (all pairs of vertices are connected by chords 
except for the n edges of the polygon). Each intersection point splits two line 
segments, increasing the number of line segments by 2. Then the answer to part 
(b) is C(n, 2) — n + 2C(n, 4). 

(c) The total number of edges e equals the number of line segments — C(n, 2) — n + 
2 C(n, 4) [from part (b)] — plus the number of edges of the polygon — n edges — 
yielding e = C(n, 2) + 2 C(n, 4). The number of vertices v is the number of 
polygon vertices plus intersection points, v = n + C(n, 4). By Euler’s formula, 
the number r of regions (excluding the infinite region) isr = e — v + 1 = 
[C(n, 2) + 2C(n, 4)] - [n + C(n, 4)] + 1 = C(n, 2) + C(n, 4) - n + 1. 


Exercise 87 

A man has seven friends. How many ways are there to invite a different subset of 
three of these friends for a dinner on seven successive nights such that each pair of 
friends are together at just one dinner? 

Answer 

We need to find all possible unordered collections of seven 3-subsets such that each 
pair of the seven friends appears in exactly one subset. Consider the following table 
of one such collection of 3-subsets (an X means that the element of that row is in the 
subset of that column): 

Subsets 




1 

2 

3 

4 

5 

6 

7 

F 

A 

X 

X 

X 





r 

B 

X 



X 

X 



i 

C 

X 





X 

X 

e 

D 


X 


X 



X 

n 

E 


X 



X 

X 


d 

F 



X 

X 


X 


s 

G 



X 


X 


X 


Let the first three 3-subsets be the ones involving friend A. There are [(®) x (,) x 
(^)]/3 ! = 15 collections of 3-subsets involving A — we count all ways to pair off the 
other six friends into 3-subsets containing A, and divide by 3! so that the pairings 
are not ordered. Let the first subset involve B as well as A. A general collection of 
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three 3-subsets with A has the form (A, B, C'), (A, D', E'), (A, F, G') where C' is the 
other friend in the subset with A and B, D' is the (alphabetically) first of the friends 
not in the first subset, E' is the other friend in the subset with A and D', and F' is 
alphabetically earlier than G'. Replacing C, D, E, F, and G by C\ D', E', F, and G' in 
the above table, the only remaining choice is whether B or C' forms 3-subsets with 
D', F' and E', G' — two choices. So there are 15x2 collections of seven 3-subsets of 
the seven friends such that each pair appears in one 3-subset. 

Each such collection can be arranged over the seven successive nights in 7 ! ways. 
So the answer is 15 x 2 x 7!. 

Section 5.3 

Exercise 27 

How many ways are there to split a group of 2 n as, 2 n (is, and 2 n y s in half 
(into two groups of 3 n letters)? {Note: The halves are unordered; there is no first 
half.) 

Answer 

Count the ways to select 3n letters from the three types of letters and then subtract 
outcomes with 2n + 1 or more of one letter — C(3n + 3—1, 3 n) — 3 x C((n — 1) + 
3 — 1, n — 1). Since each split forms two groups of 3 n letters, it appears we should 
divide this count of 3«-letter groups by 2. However, the split in which each group 
consists of n letters of each type is not double counted. So the answer is j[C(3« + 
3-1, 3/i) - 3 x C((n - 1) + 3 - 1, n - 1) - 1] + 1. 

Exercise 30 

How many arrangements of six Os, five Is, and four 2s are there in which 

(a) The first 0 precedes the first 1? 

(b) The first 0 precedes the first 1, which precedes the first 2? 

Answer 

(a) Position the four 2s among the 15 positions — C(15, 4) ways — then put a 0 in the 
first of the remaining positions — one way — and then pick five other positions for 
the remaining Os — C(10, 5) ways. Answer: C(15, 4) x C(10, 5). 

(b) Put a 0 in the first position — one way — then pick five other positions for the 
remaining Os — C(14, 5) ways — then put a 1 in the first of the remaining 
positions — 1 way — and then pick four other positions for the remaining Is — 
C(8, 4) ways: C(14, 5) x C(8, 4). 

Exercise 3 1 

How many arrangements are there of An letters, four of each of n types of letters, in 
which each letter is beside a similar letter? 
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Answer 

There cannot be exactly three consecutive letters the same, because that would leave 
the fourth letter of that type alone. So there are either two in a row or four in a row. But 
four in a row is the same as two consecutive two-in-a-rows. So our problem reduces 
to counting arrangements of 2 n objects (two-in-a-rows) of n types with 2 of each type. 
There are (2n)!/(2!)” such arrangements. 

Exercise 33 

When a coin is flipped n times, what is the probability that 

(a) The first head comes after exactly m tails? 

(b) The ;th head comes after exactly m tails? 

Answer 

(a) There are 2" outcomes in all. The sequence of flips begins with m successive tails 
followed by a head. The sequence can be completed in 2"~ <m+1> ways. Probability: 

2«-(m+l) ^2" = 2~( m + 1 > 

(b) If the ith head comes after exactly m tails, then the first m + (i — 1 ) flips con- 
tain m tails and i — 1 heads — C(m + (i — 1), m) ways. The remaining flips 
are unrestricted — ways. Probability: C(m + (i — 1), m)2' ,_t " i + ,) /2" = 
C(m + i - 1, /?j)2 _(m+,) . 

Exercise 36 

How many arrangements of five as, five /Is and five y s are there with at least one ft 
and at least one y between each successive pair of as? 

Answer 

There are three cases: 

1. Exactly one ft and exactly one y between each pair of as: between each of the 
four pairs of as, the ft or the y can be first — 2 4 ways. The fifth ft and fifth y along 
with the sequence of the rest of the letters can be considered as three objects to 
be arranged — 3! ways. Altogether, 2 4 x 3! = 96 ways. 

2. Exactly one ft between each pair of as and two y s between some pair of as (or 
two fts between some pair of as and exactly one y between each pair of as): 
there are four choices for between which pair of as the two y s go and three ways 
to arrange the two ys and one ft there. There are 2' choices for whether the ft or 
the y goes first between the other three pairs of as and two choices for at which 
end of the arrangement the fifth ft goes. Multiplying by 2 for the case of two fts 
between some pair of as, we obtain 2 x (4 x 3 x 2? x 2) = 384 ways. 

3. Two ft s between some pair of as and two ys between some pair of as: There are 
two subcases. If the two fts and two y s are between the same pair of as, there are 
four choices for which pair of as, C( 4, 2) ways to arrange them between this pair 
of as, and 2 3 choices for whether the ft or the y goes first between the other three 
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pairs of as. If two /is and two y s are between the different pairs of as, there are 
4x3 ways to pick between which as the two /is and then between which as the 
two y s go, 3 2 ways to arrange the two ys and one /i and to arrange the one y and 
two /is, and 2 2 choices for whether the /i or the y goes first between the other 
two pairs of as. Together, 4 x C( 4, 2) x 2 3 + 4 x 3 x 3 2 x 2 2 = 1056 ways. 

All together, the three cases give us a total of 96 + 384 + 1056 = 1536 arrangements. 

Section 5.4 

Exercise 27 

If n distinct objects are distributed randomly into n distinct boxes, what is the proba- 
bility that 

(a) No box is empty? 

(b) Exactly one box is empty? 

(c) Exactly two boxes are empty? 

Answer 

(a) n\/n n . 

(b) Pick which box is empty — n choices — then which other box gets two objects — 
n — 1 choices — then which two objects go into this box — C(n, 2) choices — and 
then distribute the remaining n — 2 objects into the remaining n — 2 boxes, one 
per box — (n — 2)! ways. Probability: n x (n — 1) x C(n, 2) x (n — 2 )\/n". 

(c) Pick which two boxes are empty — C{n, 2) choices. Two cases: Case (i). One 
box has three objects: pick the box with three objects — n — two choices — 
then pick which three objects go in this box — C(n, 3) choices — and then 
distribute the remaining n — three objects into the remaining n — three 
boxes — ( n — 3)! ways. Case (ii). Two boxes which each have two ob- 
jects: pick the two boxes with two objects — C(n — 2, 2) choices — then 
pick which two objects go into the first 2-object box and which two ob- 
jects go into the second 2-object box — C(n, 2) x C(n — 2, 2) choices — 
and then distribute the remaining n — four objects into the remaining 
n — 4 boxes — (n — 4)! ways. The probability is C(n, 2) x {(« — 2) x C(n, 3) x 
(n - 3)! + Cin - 2, 2) x C(n , 2) x C(n - 2, 2) x (n - 4 )!}/«". 


Exercise 28 

How many ways are there to distribute eight balls into six boxes with the first two 
boxes collectively having at most four balls if 

(a) The balls are identical? 

(b) The balls are distinct? 
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Answer 

Break into five cases of zero or one or two or three or four balls in first two boxes. 

4 

(a) £ C(k + 2- 1,/fc) x C(8-£ + 4- 1, 8 -it). 

k=0 

4 

(b) ^2 C(8, k) x 2 k x 4^~ k : if k distinct balls are in the first two boxes, pick which k 

k = o 

balls — C(8, k) ways — then decide for each ball into which of the first two boxes it 
goes — 2 k ways — and then distribute remaining 8 — k distinct balls into the other 
four boxes — 4 s ~ k ways. 

Exercise 47 

How many arrangements of the letters in INSTITUTIONAL have all of the following 
properties simultaneously? 

(a) No consecutive Ts 

(b) The 2 Ns are consecutive 

(c) Vowels in alphabetical order 
Answer 

First, we handle the constraint of consecutive N’s by gluing the two Ns together into 
one letter. Next there are C((9— 2) + 4—1, (9—2)) = C(10, 7) patterns of three Ts and 
nine non-Ts (including the double N as a single letter) with no consecutive Ts. There 
remains the subproblem of ordering the nine non-Ts in the nine positions chosen for 
them. There are several approaches possible. We use the following strategy. First place 
the double N in the ordering — nine choices — then place the L — eight choices — then 
place the S — seven choices. Now there is just one way to put the six vowels in the 
remaining six places in alphabetical order. The total answer is C(10, 7)x 9x8x7. 

Exercise 52 

Among all arrangements of WISCONSIN without any pair of consecutive vowels, 
what fraction have W adjacent to an I? 

Answer 

Arrangements of WISCONSIN without any pair of consecutive vowels: 
C[(6 — 2) + 4 — 1, (6 — 2)] = C(7, 4) patterns of vowels and consonants with- 
out consecutive vowels; three ways to distribute I, I, O among three vowel positions; 
6!/2!2! ways to distribute consonants among consonant positions. Denominator in 
fraction is then C(7, 4) x 3 x 6!/2!2! 

For the numerator, we look at three cases: 

1. W is beside an I that is the first vowel. If W is just before the I (glue W and I 
together), there are C((5 — 2) + 4 — 1 , (5 — 2)) = C(6, 3) patterns for distributing 
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the other consonants to assure no consecutive vowels, whereas if W is just after I, 
there are C((5 — 1) + 4 — 1, (5 — 1)) = C(7, 4) patterns. In either case, there are 
two ways to order the other vowels and 5!/2!2! ways to arrange the other 
consonants. 

2. If W is beside an I that is the last vowel, the number of outcomes is the same as 
in case (a). 

3. If W is beside an I that is the middle vowel, there are C((5 — 1) + 4 — 1 , (5 — 1)) = 
C(7, 4) patterns for distributing the other consonants to assure no consecutive 
vowels whether W is just before or just after the I — two choices for W. Again 
the other vowels and other consonants can be placed in 2 x 5!/2!2! ways. We 
must subtract the arrangements with the subsequence IWI (that is, W is the only 
consonant between two Is): C((5 — 1) + 3 — 1, (5 — 1)) x 2 x 5!/2!2! In total, 
the numerator is [2 x C( 6, 3) + 4 x C( 7, 4) - C( 6, 4)] x 2 x 5!/2!2! 

Exercise 59 

How many subsets of six integers chosen (without repetition) from 1,2, . . . , 20 are 
there with no consecutive integers (e.g., if 5 is in the subset, then 4 and 6 cannot be 
in it)? 

Answer 

Form a binary sequence of length 20 with six 1 s and 1 4 Os to represent which integers 
are in the subset (a 1 in the /th position means that i is in the subset). In this form, we 
seek all 20-digit binary sequences with six nonconsecutive Is. C((14 — 5) + 7 — 1, 
(14-5)). 

Exercise 61 

How many arrangements are there of n Os and m Is with k runs of Os? A run is a 
consecutive set (1 or more) of the same digit; e.g., 000 1 1 10100 has three (underlined) 
runs of 0s. 

Answer 

Represent a run no matter what its length as an R. Then we arrange m Is and k Rs 
with no consecutive Rs. Using the reasoning in Exercise 59, there are C(m + 1, k) 
arrangements. Next pick how many 0s there are in each run. We select distribute 
the n 0s into k boxes (runs) with at least one 0 in each box — C(n — 1, k — 1) ways: 
C(m + 1, &) x C(n — l, k — 1). 

Exercise 64 

How many ways are there to distribute 20 distinct flags onto 12 distinct flagpoles if 

(a) In arranging flags on a flagpole, the order of flags from the ground up makes a 
difference? 

(b) No flagpole is empty and the order on each flagpole is counted? 
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Answer 

(a) Lay the flagpoles on the ground end-to-end with a slash (|) between flag- 
poles. Then we need to count all arrangements of the 20 distinct flags and 
(12— l)]s — 311/11! ways. 

(b) First we distribute 20 identical flags into 12 flagpoles with no flagpole empty in 
C(20— 1, 12— 1) ways. Now as in part (a), lay the flagpoles end to end. Then 
arrange the 20 distinct flags among the 20 places where there are (identical) flags 
in 20! ways: C(20-l, 12 — 1) x 20! 
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CHAPTER 6 

GENERATING 

FUNCTIONS 



GENERATING FUNCTION MODELS 


In this chapter, we introduce the concept of a generating function. Generating func- 
tions are developed in this chapter to handle special constraints in selection and 
arrangement problems with repetition. They are used in Chapters 7,8, and 9 to solve 
other combinatorial problems. Generating functions are one of the most abstract 
problem-solving techniques introduced in this text. But once understood, they are 
also the easiest way to solve a broad spectrum of combinatorial problems. 

Suppose a r is the number of ways to select r objects in a certain procedure. Then 
g(x ) is a generating function for a r if g(x) has the polynomial expansion 

g(x) — ao + a\x + a 2 X 2 + • — h a r x r + • — h a„x n 


If the function has an infinite number of terms, it is called a power series. 
In Section 5.5 we verified the well-known binomial expansion 


(l+x) n = 1 + 






x 


n 


Then g(x) — (1 + x)" is the generating function fora, =C(n, r). the number of ways 
to select an r-subset from an n-set. Recall that we derived the expansion of (1 + x)" 
by first considering the formal multiplication of ( a + x) 3 : 


(a + x){a + x)(a + x) = aaa + aax + axa + axx + xaa 
+ xax + xxa + xxx 


When a = 1 , we obtained 

(1 + x)(l + x)(l + x) = 1 1 1 + 1 lx + lxl + lxx + xl 1 

+ X lx + XX 1 + xxx ( 1 ) 

Such a formal expansion lists all ways of multiplying a term in the first factor times a 
term in the second factor times a term in the third factor. The problem of determining 
the coefficient of x r in ( 1 + x) 3 , and more generally in (1 + x) ,! , reduces to the problem 
of counting the number of different formal products with r xs and (n — r) Is. Such 
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formal products are all sequences of r xs and (n — r) Is. So the coefficient of x r in 
(1 + x) 3 is C( 3, r), and in (1 + x) n is C(n, r). 

It is very important that multiplication in a product of several polynomial factors 
be viewed as generating the collection of all formal products obtained by multiplying 
together a term from each polynomial factor. If the ;th polynomial factor contains r, 
different terms and there are n factors, then there will he r\ x t '2 x r; x • ■ • x r„ differ- 
ent formal products. For example, there will be 2" formal products in the expansion 
of (l+Jt)". 

In the expansion of (1 + x + x 2 ) 4 , the set of all formal products will be sequences 
of the form 



that is, a 1 or an x or an x 2 in each of the entries in the product, such as xlx 2 x. 

In this chapter we are primarily concerned with multiplying polynomial factors 
in which the powers of x in each factor have coefficient 1, factors such as (1 + x + 
x 2 + x 3 ) or (1 + x 2 + x 4 + x 6 H — ■)■ These factors are completely specified by the 
set of different exponents of x. Note that 1 = x°. Thus expansion (1) can be rewritten 


/ 0 , 1\/ 0 , K, 0 , K 0 0 0, 0 0 1, 0 1 0, 0 1 1, 1 0 0 

(x + x )(x +x)(x +x) = xxx +XXX + X X X +XXX +XXX 


x 1 x°x 1 + x'x'x 0 + x'x'x 1 


And the formal products in expansion (2) can be written as 

J or x ei x e2 x e, x ei 0 < e, < 2 (3) 

The problem of determining the coefficient of x r when we multiply several 
such polynomial factors together can be restated in terms of exponents. Consider 
the coefficient of x 5 in the expansion of (1 +x +x 2 ) 4 . It is the number of different 
formal products, such as x 2 x°x 2 x', fonned in expansion (3) whose sum of exponents 
is 5. Determining the coefficient of x 5 in (1 + x + x 2 ) 4 can be modeled as an integer- 
solution-to-an-equation problem (Example 6 of Section 5.4). The number of formal 
products x ei x e2 x e, x ei 0 < e,- < 2 equaling x 5 is the same as the number of integer 
solutions among the exponents to 



e\ + e2 + e-i + e\ — 5 0 < e,- < 2 

According to the equivalent forms of selection-with-repetition discussed in Sec- 
tion 5.4, the preceding integer-solution-to-an-equation problem is equivalent to the 
problem of selecting five objects from a collection of four types, with at most two 
objects of each type. It is also equivalent to the problem of distributing five identical 
objects into four distinct boxes with at most two objects in each box. 
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More generally, the coefficient of x r in (1 + x + x 2 ) 4 — that is, the number of 
formal products x e ' x ei x e3 x ei 0 < e ( - < 2 equaling x r — will be the number of integer 
solutions among the exponents to 

e\ + e 2 + ez + e\ — r 0 < e, <2 

This problem in turn equals the number of ways of selecting r objects from four types 
with at most two of each type (or of distributing r identical objects into four boxes 
with at most two objects in any box). Thus (1 + x +x 2 ) 4 is the generating function 
for a r , the number of ways to select r objects from four types with at most two of 
each type (or to perform the equivalent distribution). 

At this stage, we are concerned only with how to build generating function 
models for counting problems. In the next section we will see how various algebraic 
manipulations of generating functions permit us to evaluate desired coefficients. 

We have shown how the coefficients of (1 + x + x 2 ) 4 can be interpreted as the so- 
lutions to a certain selection-with-repetition or distribution-of-identical-objects prob- 
lem. This line of reasoning can be reversed: given a certain selection-with-repetition 
or distribution problem, we can build a generating function whose coefficients are the 
answers to this problem. We now give some examples of how to build such generating 
functions. We use the intermediate model of an integer-solution-to-an-equation to aid 
in the construction of generating function models. 


Example 1 

Find the generating function for a r , the number of ways to select r balls from three 
green, three white, three blue, and three gold balls. 

This selection problem can be modeled as the number of integer solutions to 

e\ + e 2 + e?, + es, — r 0 < e,- < 3 

Here e\ represents the number of green balls chosen, ei the number of white, <23 blue, 
and e4 gold. To be more concrete, suppose r — 6. Then the integer equation model is 

e \ + £2 + e 3 + e 4 — 6 0 < e, < 3 

We want to construct a product of polynomial factors such that when multiplied 
out formally (as described above), we obtain all products of the form x e, x e2 x e3 x e4 , 
with each exponent e ,■ between 0 and 3. In the case r = 6 a possible formal product 
would be x 2 x°x'x 3 . Then we need four factors, and each factor should consist of an 
“inventory” of the powers of x from which e,- is chosen. That is, each factor should be 
(x° + x 1 + x 2 + x 3 ). The desired generating function is thus (x° + x 1 + x 2 + x 3 ) 4 or 
(1 +x +x 2 + x 3 ) 4 . ■ 


Example 2 

Find a generating function for the number of ways to select r doughnuts from five 
chocolate, five strawberry, three lemon, and three cherry doughnuts. Repeat with the 
additional constraint that there must be at least one of each type. 
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The initial selection problem can be modeled as the number of integer solutions 
to 

e\ + <?2 + 63 + e4 = r, 0<ei,e2<5, 0 <e^,e4<3 

Here e\ represents the number of chocolate doughnuts selected, e2 the number of 
strawberry doughnuts, e 3 the number of lemon doughnuts, and 64 the number of 
cherry doughnuts. We want to construct a product of polynomial factors such that 
when multiplied out formally, we obtain all products of the form x ei x e2 x e3 x ei with 
each et bounded as in the integer solutions model. For ei and e-i the factor is (x° + 
x 1 + x 2 + x 3 + x 4 + x 5 ). For es and e^, the factor is (x° + x 1 + x 2 + x 3 ). Then the 
required generating function is (x° + x 1 + x 2 + x 3 + x 4 + x 5 ) 2 (x° + x 1 + x 2 + x 3 ) 2 . 

When the additional constraint is added of at least one doughnut of each type, 
the integer solution model becomes 

e\ + e2 + ej + £4 = r, l<«i,e2<5, 1 < £3, ^4 < 3 

Then the polynomial factor for e\ and ei becomes (x 1 + x 2 + x 3 + x 4 + x 5 ) and for e 3 
and e4 becomes (x 1 + x 2 + x 3 ). The required generating function is (x 1 + x 2 + x 3 + 
x 4 + x 5 ) 2 (x' +x 2 +x 3 ) 2 . ■ 

Example 3 

Use a generating function to model the problem of counting all selections of six 
objects chosen from three types of objects with repetition of up to four objects of 
each type. Also model the problem with unlimited repetition. 

This selection problem can be modeled as the number of integer solutions to 

e\ + e2 + e3 = 6 0 < e, < 4 

This problem does not ask for the general solution of the ways to select r objects. How- 
ever, in building a generating function, we automatically model all values of r, not just 
r = 6. Wanting a solution to the problem for six objects means that we are interested 
only in the coefficient of x 6 — that is, the ways x ei x e2 x C3 can equal x 6 . We build a 
generating function with a factor of (1 + x + x 2 + x 3 + x 4 ) for each x e ‘ . The desired 
generating function is (1 + x + x 2 + x 3 + x 4 ) 3 , and we want the coefficient of x 6 in it. 

Permitting unlimited repetition means that any number can be chosen of each 
type and so any exponent value is possible (although in this particular problem no 
exponent can exceed 6). From this point of view, the answer is the coefficient of 
x 6 in (1 + x + x 2 + x 3 + ■ • -) 3 , where “ + • • •” means that the factor is an infinite 
series. ■ 

In the unlimited repetition case above, we could have used the generating function 
(1 + x + x 2 + x 3 + x 4 + x 5 +x 6 ) 3 , since we wanted only the coefficient ofx 6 and thus 
greater powers of x could not be used. However, we shall see in the next section that 
it is easier to use infinite series in generating functions. In selection-with-repetition 
problems, when we ask for six objects from three types with unlimited repetition, 
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we do not add the constraint of at most six of any type — it is implicit in the problem 
and there is no need to make it explicit. The same situation applies with generating 
functions. 


Example 4 

Find a generating function for a r , the number of ways to distribute r identical objects 
into five distinct boxes with an even number of objects not exceeding 10 in the first 
two boxes and between three and five in the other boxes. 

While the constraints may be a bit contrived in this example, it is still easy to 
model the problem as an integer-solution-to-an-equation problem with appropriate 
constraints. We want to count all integer solutions to 


e 1; e2even 0<ei,e2<10, 

3 < e 3 , e 4 , e$ < 5 


ei+e 2 + e 3 + e 4 + e 5 = r 


To generate all formal products of the form x ei x e2 x e3 x e4 x e5 , with the given constraints 
on the e,s, we need a product of five factors, each containing the inventory of the 
powers of x permitted for its x e ‘ . For example, the inventory for x e ' is (1 + x 2 + x 4 + 
x 6 + x 8 + x 10 ). The required generating function is g(x) — ( 1 + x 2 + x 4 + x 6 + x 8 + 



With a little practice, generating function models become simple to build. How- 
ever, the concept behind generating functions is far from simple. In the previous 
chapter, we solved similar selection and distribution problems with explicit formu- 
las involving binomial coefficients. Now we are modeling these problems as some 
coefficient, which represents the number of formal products in the multiplication of 
certain polynomial factors. All we write down is the polynomial factors. 

A generating function cannot be designed to model a selection or distribution 
problem for just one amount — say, six objects. It must model the problem for all 
possible numbers of objects. A generating function model stores all the necessary 
information about these subproblems in one function. 

In the next section, determining the value of a specified coefficient of a generating 
function will be reduced to a series of rote algebraic operations. The fact that the 
function models a complex selection or distribution problem will be irrelevant. The 
combinatorial reasoning arises solely in the construction of generating functions. 


6.1 EXERCISES 



' Summary of Exercises The first 2 1 exercises involve simple gen- 
erating function modeling. The remaining problems involve more challenging mod- 
eling; Exercises 25-29 use multinomial generating functions. 
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1. For each of the following expressions, list the set of all formal products in which 
the exponents sum to 4. 

(a) (l+x + x 3 ) 2 (l+*) 2 

(b) ( 1 + x + x 2 + x 3 + x 4 ) 2 

(c) (l+x 3 +x 4 ) 2 (l+x + x 2 ) 2 

(d) (1 + x + x 2 + x i -\ ) 3 

2. B uild a generating function for a r , the number of integer solutions to the following 
equations: 

(a) ei+e 2 + e 3 +e 4 + e 5 — r, 0 < e, < 5 

(b) e\+ ei + e 3 = r, 0 < e, < 6 

(c) e\ + e 2 + e 3 + e 4 = r, 2 < a < 7 e\ even, e 2 odd 

(d) e i + e 2 + + e 4 = r, 0 < e,- 

(e) e l +e 2 + e 3 + e 4 = r, 0<e e 2 , e 4 odd, ^4 < 3 

3. Build a generating function for a r , the number of r selections from 

(a) Five red, five black, and four white balls 

(b) Five jelly beans, five licorice sticks, eight lollipops with at least one of each 
type of candy 

(c) Unlimited amounts of pennies, nickels, dimes, and quarters 

(d) Six types of lightbulbs with an odd number of the first and second types 

4. Build a generating function for a r , the number of distributions of r identical 
objects into 

(a) Five different boxes with at most three objects in each box 

(b) Three different boxes with between three and six objects in each box 

(c) Six different boxes with at least one object in each box 

(d) Three different boxes with at most five objects in the first box 

5. Use a generating function for modeling the number of 5-combinations of the 
letters M, A, T, H in which M and A can appear any number of times but T 
and FI appear at most once. Which coefficient in this generating function do we 
want? 

6. Use a generating function for modeling the number of different selections of r 
hot dogs when there are four types of hot dogs. 

7. Use a generating function for modeling the number of distributions of 16 choco- 
late bunny rabbits into four Easter baskets with at least three rabbits in each 
basket. Which coefficient do we want? 

8. (a) Use a generating function for modeling the number of different election 

outcomes in an election for class president if 25 students are voting among 
four candidates. Which coefficient do we want? 
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(b) Suppose each student who is a candidate votes for herself or himself. Now 
what is the generating function and the required coefficient? 

(c) Suppose no candidate receives a majority of the vote. Repeat part (a). 

9. Find a generating function for a r , the number of /--combinations of an //-set with 
repetition. 

10. Given one each of u types of candy, two each of v types of candy, and three each 
of w types of candy, find a generating function for the number of ways to select 
r candies. 

11. Find a generating function for c//.. the number of ^-combinations of n types of 
objects with an even number of the first type, an odd number of the second type, 
and any amount of the other types. 

12. Find a generating function for a r , the number of ways to distribute r identical 
objects into q distinct boxes with an odd number between r\ and ,V| in the first 
box, an even number between r 2 and ,v 2 in the second box, and at most three in 
the other boxes. Note that r\ and .v i are assumed to be odd numbers; r 2 and ,v 2 are 
assumed to be even numbers. 

13. Find a generating function for a r , the number of ways n distinct dice can show a 
sum of r. 

14. Find a generating function for a r , the number of ways a roll of six distinct dice 
can show a sum of r if 

(a) The first three dice are odd and the second three even 

(b) The ;th die does not show a value of i 

15. Build a generating function for a r , the number of integer solutions to e \ + e 2 + 
(?3 + e 4 = r, —3 < e j < 3. 

16. Find a generating function for the number of integers between 0 and 999,999 
whose sum of digits is r. 

17. Find a generating function for the number of selections of r sticks of chewing 
gum chosen from eight flavors if each flavor comes in packets of five sticks. 

18. (a) Use a generating function for modeling the number of distributions of 20 iden- 

tical balls into five distinct boxes if each box has between two and seven balls. 

(b) Factor out an x 2 from each polynomial factor in part (a). Interpret this 
revised generating function combinatorially. 

19. Use a generating function for modeling the number of ways to select five integers 
from 1,2, ...,«, no two of which are consecutive. Which coefficient do we want 
for n = 20? For a general n! 

20. Explain why (I + x + x 2 H — • + x r ) 4 is not a proper generating function for a r , 
the number of ways to select r objects from four types with repetition. What is 
the correct generating function? 

21. Explain why (l+x + x 2 + x 3 + x 4 ) r is not a proper generating function for the 
number of ways to distribute r jelly beans among r children with no child getting 
more than four jelly beans. 
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22. Show that the generating function for the number of integer solutions to 

e\ + ^2 + e-} + e 4 — r, 0 < e\ < e 2 < e-$ < e \ , is 

(l+x+x~ + -- -)(l-t-x‘'-|-x 4 + ■••) 

(l+x 3 -t-x 6 + ---)U+x 4 + x 8 + ---) 

23. Find a generating function for the number of ways to make r cents change in 
pennies, nickels, and dimes. 

24. A national singing contest has five distinct entrants from each state. Use a gen- 
erating function for modeling the number of ways to pick 20 semifinalists if 

(a) There is at most one person from each state. 

(b) There are at most three people from each state. 

25. Find a generating function g(x, y) whose coefficient of x r y s is the number of 
ways to distribute r chocolate bars and 5 lollipops among five children such that 
no child gets more than three lollipops. 

26. (a) Find a generating function g(x, y, z) whose coefficient x r y s z' is the number 

of ways to distribute r red balls, 5 blue balls, and t green balls to n people 
with between three and six balls of each type to each person. 

(b) Suppose also that each person gets at least as many red balls as blues. 

(c) Suppose also that the first three people get equal numbers of reds and blues. 

(d) Suppose also that no one gets the same number of green and red balls. 

27. Find a generating function g(x, y, z) whose coefficient of x r y s z t is the number 
of ways eight people can each pick two different fruits from a bowl of apples, 
oranges, and bananas for a total of r apples, s oranges, and t bananas. 

28. Find a generating function (jq , , . . . , x ,„ ) whose coefficient of x\' x -^ . . . x r ™ is 

the number of ways n people can pick a total of q chairs of type 1, q chairs of 
type 2, . . . r m chairs of type m if 

(a) Each person picks one chair 

(b) Each person picks either two chairs of one type or no chairs at all 

(c) Person i picks up to i chairs of exactly one type 


29. If g(xi, X2 , . . . , x p ) = (xi +X2 -\ h x p )'\ p > n, how many terms of 

g(xi , . . . , Xp)’s expansion have no exponent of any x ; greater than 1? What 
is the coefficient of one of these terms? 



6.2 CALCULATING COEFFICIENTS 
OF GENERATING FUNCTIONS 

We now develop algebraic techniques for calculating the coefficients of generating 
functions. All these methods seek to reduce a complex generating function to a simple 
binomial-type generating function or a product of binomial -type generating functions. 
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For easy reference, we list in Table 6.1 all the polynomial identities and expansions 
to be used in this section. 

The rule for multiplication of generating functions in (6) is simply the stan- 
dard formula for polynomial multiplication. Identity (1) can be verified by poly- 
nomial “long division.” We restate it, multiplying both sides of (1) by (1 — x), as 
(1 — x m+1 ) = (1 — jc)( 1 + x + x 2 H — ■ + x m ). We verify that the product of the right- 
hand side is 1 — x' n+l by “long multiplication.” 

1 -{- x -{- x 1 T * * ■ T x m 

1 — x 

lT.v+.rH — ■ + x' n (*) 


1 -x m+l 

If m is made infinitely large, so that 1 + x + x 2 + ■ — f- x m becomes the infinite 
series 1 + x + x 2 + • ■ -, then the multiplication process (*) will yield a power series in 
which the coefficient of each x k , k > 0, is zero [the reader can confirm this in (*)]. We 
conclude that (1 — x)( 1 + x + x 2 + ■ ■ •) = 1 . [Analytically, this equation is valid for 
\x\ < 1; the “remainder” term x m+l in (*) goes to 0 as m becomes infinite.] Dividing 
both sides of this equation by (1 — x) yields identity (2). 

Expansion (3), the binomial expansion, was explained at the start of Section 6.1. 
Expansion (4) is obtained from (3) by expanding (1 + y) n , when y = —x m : 

[1 + (-*")]» = 1 + (")(-*'”)+ (”)(-* m ) 2 
+ - + C) < -'" ),+ - + 


Table 6.1 Polynomial Expansions 
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By identity (2), (1 — x) ", or equivalently (737)", equals 

(1 + x + x 2 + x 3 + • • •)” (7) 

Let us determine the coefficient of x r in (7) by counting the number of formal products 
whose sum of exponents is r. If e,- represents the exponent of the ith term in a formal 
product, then the number of formal products x ei x e2 x e3 . . . x e " whose exponents sum 
to r is the same as the number of integer solutions to the equation 

e\ + e 2 + e 3 H h e n =r e, > 0 

In Example 5 of Section 5.4, we showed that the number of nonnegative integer 
solutions to this equation is C(r + n — 1 , r). Thus the coefficient of x r in (7) is C(r + 
n — 1, r). This verifies expansion (5). 

With formulas (1) to (6) we can determine the coefficients of a variety of gener- 
ating functions: first, perform algebraic manipulations to reduce a given generating 
function to one of the forms (1 + x)", (1 — x m )" , or ( 1 — x) _ ", or a product of two such 
expansions; then use expansions (3) to (5) and the product rule (6) to obtain any desired 
coefficient. We illustrate some common reduction methods in the following examples. 


Example 1 

Find the coefficient of x 16 in (x 2 + x 3 + x 4 + • ■ -) 5 - What is the coefficient of x r ? 

To simplify the expression, we extract x 2 from each polynomial factor and then 
apply identity (2). 

(x 2 +x 3 +x 4 + ---) 5 = [X 2 (l+X + X 2 + ---)] 5 

= x 10 (l+x+x 2 + ---) 5 =xi°— L- 

(1-x) 5 

Thus the coefficient of x 16 in (x 2 + x 3 + x 4 H — -) 5 is the coefficient of x 16 in x 10 
(1 — x)“ 5 . But the coefficient of x 16 in this latter expression will be the coefficient 
of x 6 in (1 — x) -5 [i.e., the x 6 term in (1 — x) -5 is multiplied by x 10 to become the 
x 16 term in x 10 (l — x) -5 ]. From expansion (5), we see that the coefficient of x 6 in 
(1-x)- 5 is C(6 + 5 — 1, 6). 

More generally, the coefficient of x r in x 10 (l — x) -5 equals the coefficient of 
x'-to j n (i _ x )-5 — namely, C((r — 10) + 5 — 1, (r — 10)). ■ 


Observe that (x 2 + x 3 + x 4 + • • -) 5 is the generating function a r , for the number 
of ways to select r objects with repetition from five types with at least two of each 
type. In the last chapter, we solved such a problem by first picking two objects in 
each type — one way — and then counting the ways to select the remaining r — 10 
objects — C((r — 10) + 5 — 1, (r — 10)) ways. In the generating function analysis in 
Example 1, we algebraically picked out an x 2 from each factor for a total of x 10 and 
then found the coefficient of x r_ 10 in ( 1 + x + x 2 H — -) 5 > the generating function for 
selection with unrestricted repetition of r — 10 from five types. 
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The standard algebraic technique of extracting the highest common power of 
x from each factor corresponds to the “trick” used to solve the associated selection 
problem. Such correspondences are a major reason for using generating functions: 
the algebraic techniques automatically do the combinatorial reasoning for us. 


Example 2 

Use generating functions to find the number of ways to collect $15 from 20 distinct 
people if each of the first 19 people can give a dollar (or nothing) and the twentieth 
person can give either $1 or $5 (or nothing). 

This collection problem is equivalent to finding the number of integer solutions 
to x\ +X 2 + ■ — b*i 9 + X 20 = 20 when x, = 0 or 1, i = 1, 2, . . . 19, and X 20 = 0 or 
1 or 5. The generating function for this integer-solution-of-an-equation problem is 
(1 + x) 19 (l + x + x 5 ). We want the coefficient of x 15 . The first part of this generating 
function has the binomial expansion 


(1+x) 19 




19 


If we let /(x) be this first polynomial and let g(x) = 1 + x +x 5 , then we can use 
(6) to calculate the coefficient of x 15 in h(x) = f(x)g(x). Let a r be the coefficient 
of x r in /(x) and b r the coefficient of x r in g(x). We know that a r = ( 19 ) and that 
bQ — b\—b^ — \ (other b , s are zero). 

Then the coefficient of x 15 in h(x ) = /(x)g(x) is, by (6), 


a l 5 b 0 + a 14 b 1 + aub 2 H h a 0 b i5 


which reduces to 


a l 5 b 0 + aubi +a l 0 b 5 


since bo, b \ , bs are the only nonzero coefficients in g(x). Substituting the values of 
the various as and b s in (8), we have 



The answer in Example 2 could be obtained directly by breaking the collection 
problem into three cases depending on how much the twentieth person gives: $0 or $1 
or $5. In each case, the subproblem is counting the ways to pick a subset of the other 19 
people to obtain the rest of the $15. The generating function approach automatically 
breaks the problem into three cases and solves each, doing all the combinatorial 
reasoning for us. 

Example 3 

How many ways are there to distribute 25 identical balls into seven distinct boxes if 
the first box can have no more than 10 balls but any number can go into each of the 
other six boxes? 
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The generating function for the number of ways to distribute r balls into seven 
boxes with at most 10 balls in the first box is 


( 1 + X + x 2 + • • • + x 10 )(l + X + x~ + • ■ -) 6 


/ 1 — x 11 

\ 1 — X 



= (1-X H ) 



7 


using identities (1) and (2). Let /(x) = 1 — x 11 and g(x) = (1 — x) 1 . Using expansion 
(5), we have 


, /1 + 7-H 

g(x ) = (1-x) 7 = 1+ ( 1 |x 


2 + 7-1 


+ • • 


r + 7 - 1 


x' + • 


We want the coefficient of x 25 (25 balls distributed) in h(x ) = /(x)g(x). As in 
Example 2, we need to consider only the terms in the product of the two polynomials 
(1 — x 11 ) and (j4y) that yield an x 25 term. The only nonzero coefficients in /(x) = 
(1 — x 11 ) are «o = 1 and flu = — 1. So the coefficient of x 25 in /(x)g(x) is 


^0^25 T — lx 


25 + 7- 
25 


+ (-l)x 


14 + 7- 
14 


The combinatorial interpretation of the answer in Example 3 is that we count all 
the ways to distribute without restriction the 25 balls into the seven boxes, C(25 + 7 — 
1, 25) ways, and then subtract the distributions that violate the first box constraint, 
that is, distributions with at least 11 balls in the first box, C((25— 1 1) + 7 — 1, 
(25 — 1 1)) (first put 1 1 balls in the first box and then distribute the remaining balls 
arbitrarily). Again, generating functions automatically performed this combinatorial 
reasoning. ■ 


The next example employs all the techniques used in the first three examples to 
solve a problem that cannot be solved by the combinatorial methods of the previous 
chapter. 

Example 4 

How many ways are there to select 25 toys from seven types of toys with between 
two and six of each type? 

The generating function for a r , the number of ways to select r toys from seven 
types with between two and six of each type, is 

(x 2 + x 3 + x 4 + x 5 + x 6 ) 7 

We want the coefficient of x 25 . As in Example 1, we extract x 2 from each factor 
to get 

[x 2 (l + x + x 2 + x 3 +x 4 )] 7 = x 14 (l + x + x 2 + x 3 + x 4 ) 7 
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Now we reduce our problem to finding the coefficient of x 25 14 = x 11 in (1 + x ■ 
x 2 + x 3 + x 4 ) 7 . Using identity (1), we can rewrite this generating function as 


(1 + x + x 1 + x i + x 4 ) 7 — 


1 — x 


5\1 


5x7 


= (l~x 7 ) 


1 


1 —x 


1 — X 

Let fix) — (1 — x 5 ) 7 and g(x) — (1 — x) -7 . By expansions (4) and (5), respectively, 
we have 


/(x) = (l-x 5 ) 7 = l - 


5 


gW = 


1 — X 


= 1 + 


7 

1 

1 + 7-1 
1 

r + 7 - 1 


x 10 - 


1 

2 + 7-1 
2 


x 15 + • 


x 2 + ■ 


To find the coefficient of x 11 , we need to consider only the terms in the product of 
the two polynomials ( 1 — x 5 ) 7 and (yr^) that yield x 1 1 . The only nonzero coefficients 
in f{x) — {l— x 5 ) 7 with a subscript <11 (larger subscripts can be ignored) are 
ao, as, and a\o [see the expansion of /(x) above]. The products involving these three 
coefficients that yield x 1 1 terms are 

a 0 b u + a 5 b 6 + a w b x 


= 1 x 


11+7-1 

11 



6 + 7-1 
6 



x 


1 + 7-1 

1 


The following combinatorial interpretation can be given to the final answer in Ex- 
ample 4. The first term, C( 1 1 + 7 — 1, 11), counts the number of ways to select 1 1 toys 
from seven types of toys with no restriction, where 1 1 is the number of additional toys 
to select after we first pick two toys of each type. The next term, — 7C(6 + 7 — 1,6), 
subtracts seven cases of a violation where we pick at least five additional toys of 
some type (we pick five of some type and then pick 11—5 = 6 more toys from the 7 
types). The final term, C(7, 2)C(l + 7— 1,1) adds back all C( 7, 2) cases where some 
pair of violations occurred — that is, with at least five chosen from a pair of types 
[we pick five of two types and then pick 1 1 — (2 x 5) = 1 more toy from the seven 
types]. The logic behind this combinatorial approach will be developed in Chapter 8. 

We close this section by showing how generating functions can be used to verify 
binomial identities. We express the right side of the identity as a particular coefficient 
in some generating function h(x) and the left side as the same coefficient in a product 
of generating functions fix) and g(x), where h(x) = /(x)g(x). 


Example 5: Binomial Identity 

Verify the binomial identity 
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The right-hand side of the identity is the coefficient of x" in (1 + x) 2n . The left- 
hand side terms involves coefficients in ( 1 + x) n . The product of generating functions 
we want is (1 + x)"(l + x) ,! . That is, let f(x) = g(x) = (1 +x) n so that /(x)g(x ) = 
h(x) = (1 + x) 2n . Then a r = b r — C(n, r). By the product rule (6), the coefficient of 
x" in /(x)g(x) is 


a 0 b„ + a\b n _\ + • • • + a n bo 



Equating coefficients of x n in (1 + x)"(l +x)" = (1 +x) * 1 2 ", we obtain the required 
identity. ■ 


The reader is encouraged to compare the generating function proof of the pre- 
ceding combinatorial identity with the combinatorial proof of the same identity given 
in Example 3 of Section 5.5. 



6.2 EXERCISES 


Summary of Exercises The first 29 exercises are similar to the 
examples of coefficient calculation in this section. Exercises 31-36 involve binomial 
identities (several of these problems are quite tricky). Exercises 38 — 42 and 44 intro- 
duce the topic of probability generating functions (some background in probability 
is helpful). 


1 . Find the coefficient of x 10 in (1 +x +x 2 + x 3 + • ■ ■)". 

2. Find the coefficient of x r in (x 5 + x 6 + x 7 + • ■ -) 8 9 - 

3. Find the coefficient of x 7 in (1 +x 2 + x 4 )(l +x) m . 

4. Find the coefficient of x 16 in (x +x 2 + x 3 + x 4 + x 5 )(x 2 +x 3 +x 4 H — -) 5 - 

5. Find the coefficient of x 17 in (x 2 + x 3 + x 4 +x 5 +x 6 +x 7 ) 3 . 

6. Find the coefficient of x 47 in (x 10 + x 11 + • — fx 25 )(x + x 2 + • — hx 15 ) 

(x 20 H hx 45 ). 

7. Find the coefficient of x 32 in (x 3 + x 4 + x 5 + x 6 + x 7 ) 7 . 

8. Find the coefficient of x 24 in (x + x 2 + x 3 + x 4 + x 5 ) 8 . 

9. Find the coefficient of x 16 in (x + x 2 + x 3 + x 4 + x 5 + x 6 + x 7 ) 4 . 

10. Find the coefficient of x 36 in (x 2 + x 3 + x 4 +x 5 + x 6 +x 7 +x 8 ) 5 . 
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11 . 


12 . 

13 . 

14 . 

15 . 


16 . 

17 . 


18 . 


19 . 

20 . 

21 . 


22 . 


Find the coefficient of x 1 1 in 


x 2 (l — x) _ 10 

(d) 

x 2 — 3x 

(e) 

(1 -x) 4 

(1 -x 2 ) 5 


(1-x) 5 



(b) 

(c) 

Give a formula similar to (1) for 
(a) 1 + x 4 + x 8 H 1- x 24 


(b) 


x -|- 3 

1 — 2x + x 2 
b" l x m 

(1 -bx) m + l 


x 20 + x A0 + ... +x m 


Find the coefficient of x 8 in (x 2 + x 3 + x 4 + x 5 ) 5 . 

Find the coefficient of x ls in (1 + x 3 + x 6 + x 9 + • • • ) 6 . 
Find the coefficient of x 12 in 


(a) (1 — x) 8 (d) (1 — 4x) -5 

(bHl+x)- 1 (e) (1+x 3 )- 4 

(c) (1+x)- 8 

Find the coefficient of x 25 in (1 +x 3 +x 8 ) 10 . 

Use generating functions to find the number of ways to select 10 balls from a 
large pile of red, white, and blue balls if 


(a) The selection has at least two balls of each color 

(b) The selection has at most two red balls 

(c) The selection has an even number of blue balls 


Use generating functions to find the number of ways to distribute r jelly beans 
among eight children if 

(a) Each child gets at least one jelly bean 

(b) Each child gets an even number of beans 

How many ways are there to place an order for 12 chocolate sundaes if there are 
five types of sundaes, and at most four sundaes of one type are allowed? 

How many ways are there to paint the 10 identical rooms in a hotel with five 
colors if at most three rooms can be painted green, at most three painted blue, at 
most three red, and no constraint is laid on the other two colors, black and white? 
How many ways are there to distribute 20 cents to n children and one parent if 
the parent receives either a nickel or a dime and 

(a) The children receive any amounts? 

(b) Each child receives at most 1 <p. 

How many ways are there to get a sum of 25 when 10 distinct dice are rolled? 
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23 . 


24 . 

25 . 


26 . 


27 . 

28 . 
29 . 


30 . 


31 . 


32 . 


33 . 


34 . 

35 . 


How many ways are there to select 300 chocolate candies from seven types of 
candy if each type comes in boxes of 20 and if at least one but not more than 
five boxes of each type are chosen? (Hint: Solve in terms of boxes of chocolate.) 
How many different committees of 40 senators can be formed if the two senators 
from the same state (50 states in all) are considered identical? 

How many ways are there to split six copies of one book, seven copies of a 
second book, and 1 1 copies of a third book between two teachers if each teacher 
gets 12 books and each teacher gets at least two copies of each book? 

How many ways are there to divide five pears, five apples, five doughnuts, five 
lollipops, five chocolate cats, and five candy rocks into two (unordered) piles of 
15 objects each? 

How many ways are there to collect $24 from four children and six adults if 
each person gives at least $ 1 , but each child can give at most $4 and each adult at 
most $7? 

If a coin is flipped 25 times with eight tails occurring, what is the probability 
that no run of six (or more) consecutive heads occurs? 

If 10 steaks and 15 lobsters are distributed among four people, how many ways 
are there to give each person at most five steaks and at most five lobsters? 

Show that (1 — x — x 2 — x 3 — x 4 — x 5 — x 6 ) -1 is the generating function for the 
number of ways a sum of r can occur if a die is rolled any number of times. 

Use generating functions to show that 


E 

k = o 


raw n 
k / \r — k 


m + n 
r 


Use the equation 


to show that 

m/2 


2 \n 


(1 — Xj 

(1 —x)" 


= (! + *)” 




k=0 


n\ (n-\-m — 2k — 1 
n — 1 


m < n and m even 


Use binomial expansions to evaluate 


(a) J2 


k = 0 


raw n 
k)\r + k 


(b) £(-i)* 


k = 0 


n W n 
kj \r — k 


^ fk + 5 
(a) Evaluate > 

k=n\ 


(O 

k = 0 

n — k 
m — k 


(a) Show that I - 


k = o 


IV fn + k- 1 


(b) Evaluate ^ 

k=0 

oo / i \ k 

= 2". (b) Evaluate I - 1 k. 


k = 0 
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36. Why can’t you set x — — 1 in formula (5) to “prove” the following? 



37. If g(x) is the generating function for a 2: , then show that g <k] {())/ k\ = a^. 

38. A probability generating function P x (t) for a discrete random variable X has 
a polynomial expansion in which p r , the coefficient of t r , is equal to the 
probability that X — r. 

(a) If X is the number of heads that occur when a fair coin is flipped n times, 
show that P x (t) = (±)"(1 + t) n . 

(b) If X is the number of heads that occur when a biased coin is flipped n times 
with probability p of heads (and q — 1 — p), show that P x (t) — (q + pt) n . 

(c) If X is the number of times a fair coin is flipped until the fifth head occurs, 
find P x (t). 

(d) Repeat part (c) until the mth head occurs and probability of a head is p. 

39. (a) The expected value E(X) of a discrete random variable X is defined to be 

Y Pr c. Show that E(X) = P x ( 1) — that is, (d /dt)P x (t), with t set equal to 1. 
(b) Find E(X) for the random variables X in Exercise 38. 

40. (a) The second moment E 2 (X) of a discrete random variable X is defined to be 

Y P, r 2 . Show that E 2 (X) = P' x { 1) + P"(l). 

(b) Find E 2 (X) for the random variables X in Exercises 38(b) and 38(c). 

41. Suppose a fair coin is flipped until the mth head occurs and suppose that no 
more than .v tails in a row occur. If X is the number of flips, find P x (t). 

42. Experiments A! and A" have probabilities p' and p" of success in each trial 
and are performed «' and n" times, respectively. Let X' and X" be the number 
of successes in the respective experiments, and let X be the total number of 
successes on both experiments. Verify the following. 

(a) P x (t) = P' x (t)P x (t) 

(b) E(X)=E(X') + E{X") 

(c) E 2 (X) = E 2 {X') + E 2 (X") + E{X'X") 

43. Suppose a red die is rolled once and then a green die is rolled as many times 
as the value on the red die. If a r is the number of ways that the (variable length) 
sequence of rolls of the green die can sum to r, show that the generating function 
for a r is /(/(x)), where fix) = (x + x 2 + x 3 + x 4 + x 5 + x 6 ). 

44. Suppose X is the random variable of the number of minutes it takes to serve a 
person at a fast-food stand. Suppose Y is the random variable of the number of 
people who line up to be served at the stand in one minute. Let Z be the number of 
people who line up while a person is being served. Show that Pz(t) = ?x( PyC)). 
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6.3 PARTITIONS 

In this section we discuss partitions and their generating functions. Unfortunately, 
there is no easy way to calculate the coefficients of most of these generating functions. 

A partition of a group of r identical objects divides the group into a collection 
of (unordered) subsets of various sizes. Analogously, we define a partition of the 
integer r to be a collection of positive integers whose sum is r. Normally we write 
this collection as a sum and list the integers of the partition in increasing order. For 
example, the seven partitions of the integer 5 are 

l+l+l+l+l 1+1+ 1+2 1+1+3 

1+2+2 1+4 2+3 5 

Note that 5 is a “trivial” partition of itself. 

Let us construct a generating function for a r , the number of partitions of the 
integer r. A partition of an integer is described by specifying how many Is, how many 
2s, and so on, are in the sum. Let denote the number of ks in a partition. Then 


lei + 2e 2 + 3e3 H \-ket H b re r — r 


Intuitively, we can think of picking r objects from an unlimited number of piles 
where the first pile contains single objects, the second pile contains objects stuck 
together in pairs, the third pile contains objects stuck together in triples, and so 
on. To model this integer-solution-to-an-equation problem with a generating func- 
tion, we need polynomial factors whose formal multiplication yields products of the 
form 


(x 2 ) 0 
(X 2 ) 1 
(X 2 ) 2 


(X 3 ) 0 

(X 3 ) 1 

(X 3 ) 2 


(x*)° 

(x*) 1 

(x k ) 2 


The generating function g(x) must be 

g(x) — (1 + x + x 2 + x 3 + • • • + x" + • • •) 

.(l+x 2 +x 4 + x 6 + ...+x 2 ” + .-.) 
■(l+x 3 +x 6 +x 9 + ---+x 3 " + ---) 


■ (1 + x k + x 2k + x 3 * + • • • + x kn + • ■ ■) 


If we set y = x 2 . then the second factor in g(x) becomes 1 + y + y 2 + - ■ • = ( 1 — y ) 
Thus 


1 + x 2 + x 4 + x 6 H — ■ + x 2 " + • ■ ■ = (1 — x 2 ) 1 


6.3 Partitions 267 


Similarly, the A.1h factor can be written as (1 — x*) -1 . For partitions up to r = m, 
we need the first in polynomial factors. But for arbitrary values of r, we need an 
infinite number of polynomial factors. Then 

1 

^ ^ (1 — x)(l — x 2 )(l — x 3 ) ■ ■ • (1 — x k ) • • ■ 

We now consider some more specialized partition problems. 

Example 1 

Find the generating function for a r , the number of ways to express r as a sum of 
distinct integers. 

We must constrain the standard partition problem not to allow any repetition 
of an integer. For example, there are three ways to write 5 as a sum of distinct 
integers 1 + 4, 2 + 3, and 5. The appropriate modification of the generating function 
for unrestricted partitions is 

g(x) = (1 +x)(l +x 2 )(l +x 3 )(l +x 4 )- • • (1 + x k )- ■ ■ m 


Example 2 

Find a generating function for a r , the number of ways that we can choose 2(f, 3 <f, and 
5<f stamps adding to a net value of rtf. 

The problem is equivalent to the number of integer solutions to 

2e 2 + 3e 2 + 5es — r 0 < e 2 , e 2 , e$ 

The appropriate generating function is 

(1 +x 2 +x 4 + x 6 H )(1 + x 3 + x 6 T x 9 H ) • (1 +x 5 +x 10 + x 15 H ) ■ 


Example 3 

Show with generating functions that every positive integer can be written as a unique 
sum of distinct powers of 2. 

The generating function for a r , the number of ways to write an integer r as a sum 
of distinct powers of 2, will be similar to the generating function for sums of distinct 
integers in Example 1 , except that now only integers that are powers of 2 are used. 
The generating function is 

g*(x) — (1 +x)(l +x 2 )(l +x 4 )(l +x 8 ) ■ ■ ■ (1 +x 2 ‘) ■ • ■ 

To show that every integer can be written as a unique sum of distinct powers of 
2, we must show that the coefficient of every power of x in g*(x) is 1. That is, show 
that 

1 

1 — x 


g*(x) — 1 + x + x 2 + x 3 H = 
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or equivalently 


(l-x)g*(x)=l 

We prove this identity by repeatedly using the factorization (1 — x k )(\ +x k ) = 
l-x 2k . 

(1 - x)g*(x) = (1 - x)(l + x)(l + X 2 ) (1 + x 4 )(l +x 8 ) ■■■ 

= [ (1-X 2 ) ](l+.r 2 )(l+x 4 )(l+x 8 ) ••• 

= [ (1-x 4 ) ] (l+x 4 )(l+x 8 ) 

= 1 

By successively making the replacement (1 — x*)(l + x k ) = 1 —x 2k , we even- 
tually eliminate all factors in (1 —x)g*(x). Formally, the coefficient of any specific 

x k in (1 — x)g*(x) must be 0. So (1 — x)g*(x) — 1 and g*(x) = 1 +x +x 2 H , as 

required. ■ 


A convenient tool for studying partitions is a diagram known as Ferrers diagram. 
A Ferrers diagram displays a partition of r dots in a set of rows listed in order 
of decreasing size. The partition 1+2 + 2 + 3 + 7 of 15 is shown in the Ferrers 
diagram in Figure 6.1a. If we transpose the rows and columns of a Ferrers diagram 
of a partition of r, we get a Ferrers diagram of another partition of r. This diagram 
is called the conjugate of the original Ferrers diagram. For example. Figure 6.1 b 
shows the conjugate of the Ferrers diagram in Figure 6.1a. This new Ferrers diagram 
represents the partition of 15, 1 + 1 + 1 + 1+2 + 4 + 5. Clearly, transposing is 
unique: two Ferrers diagrams have equal conjugates if and only if they are equal. 


Example 4 

Show that the number of partitions of an integer r as a sum of m positive integers 
is equal to the number of pardons of r as a sum of positive integers, the largest of 
which is m. 


(b) 


Figure 6.1 


(a) 
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If we draw a Ferrers diagram of a partition of r into m parts, then the Ferrers 
diagram will have m rows. The transposition of such a diagram will have m columns, 
that is, the largest row will have m dots. Thus there is a one-to-one correspondence 
between these two classes of partitions. ■ 


6.3 EXERCISES 



' Summary of Exercises Exercises 1-10 involve generating func- 
tion models for partitions. The next seven exercises use Ferrers diagrams to 
verify partition identities. The next five exercises are more difficult partition 
problems. 

1. List all partitions of the integer: (a) 4, (b) 6. 

2. Find a generating function for a r , the number of partitions of r into 

(a) Even integers (b) Distinct odd integers 

3. Find a generating function for the number of ways to write the integer rasa sum 
of positive integers in which no integer appears more than three times. 

4. Find a generating function for the number of integer solutions of 2x + 3y + 
7 z = r with 

(a) x, y, z > 0 (b) 0 < z < 2 < y < 8 < x 

5. Find a generating function for the number of ways to make r cents’ change in 
pennies, nickels, dimes, and quarters. 

6. Find a generating function for the number of ways to distribute r identical objects 
into 

(a) Three indistinguishable boxes 

(b) n indistinguishable boxes (n < r ) 

7. (a) Show that the number of partitions of 10 into distinct parts (integers) is equal 

to the number of partitions of 10 into odd parts by listing all partitions of 
these two types. 

(b) Show algebraically that the generating function for partitions of r into distinct 
parts equals the generating function for partitions of r into odd parts, and 
hence the numbers of these two types of partitions are equal. 

8. Show with generating functions that every positive integer has a unique decimal 
representation. 

9. (a) Prove the result in Example 3 by induction. 

(b) Prove the result of Example 3 directly by recursively substituting ( 1 + x k ) = 


(1 — x 2k )/(\ — x k ) in g*(x). 


10. The equation in Example 3, g*(x)( 1 — x) = 1 , can be rewritten 1 — x = 1 /g*(x). 
Use this latter equation to prove that among all partitions of an integer r,r> 2, 
into powers of 2, there are as many such partitions with an odd number of parts 
as with an even number of parts. 
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11. Let R(r, k) denote the number of partitions of the integer r into k parts. 

(a) Show that R(r, k) = R(r — 1, k — 1 ) + R(r — k, k). 

(b) Show that Y^k=\ n — r, k) — R(n, r). 

12. Use a Ferrers diagram to show that the number of partitions of an integer into 
parts of even size is equal to the number of partitions into parts such that each 
part occurs an even number of times. 

13. Interpret the integer multiplication mn, “ m times n,” to be the sum of m ns. Prove 
that mn = nm. 

14. Show that the number of partitions of the integer n into three parts equals the 
number of partitions of 2 n into three parts of size < n. 

15. Show that the number of partitions of n is equal to the number of partitions of 2 n 
into n parts. 

16. Show that any number of partitions of 2r + k into r + k parts is the same for 
any k. 

17. Show that the number of partitions of r + k into k parts is equal to 

(a) The number of partitions of r + ( /l31 ) into k distinct parts 

(b) The number of partitions of r into parts of size < k 

18. Show that the number of ordered partitions of n is 2' !_1 . For example, the ordered 
partitions of 4 are: 4, 1+3, 2 + 2, 3+1, 1 + 1 + 2, 1 + 2+ 1, 2+1 + 1, 1 + 1 + 
1 + 1. (Flint: Write n Is in a row and determine all the ways to partition this 
sequence into clusters of Is). 

19. (a) Find a generating function for a n , the number of partitions that add up to at 

most n. 

(b) Find a generating function for a n , the number of partitions of n into three 
parts in which no part is larger than the sum of the other two. 

(c) Find a generating function for a n , the number of different (incongruent) 
triangles with integral sides and perimeter n. 

20. Show that 2(1 — x) _3 [(l — x) -3 + (1 + x) -3 ] is the generating function for the 
number of ways to toss r identical dice and obtain an even sum. 

21. (a) A partition of an integer r is self-conjugate if the Ferrers diagram of the 

partition is equal to its own transpose. Find a one-to-one correspondence 
between the self-conjugate partitions of r and the partitions of r into distinct 
odd parts. 

(b) The largest square of dots in the upper left-hand corner of a Ferrers diagram 
is called the Durfee square of the Ferrers diagram. Find a generating 
function for the number of self-conjugate partitions of r whose Durfee 
square is size k (ak x k array of dots). (Flint: Use a 1 — 1 correspondence 
between these and the partitions of r — k 2 into even parts of size at 
most 2k.) 
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(c) Show that 

(l+x)(l+x 3 )(l+x 5 )... 

OO k 2 

■“ (1 — x 2 )(l — x 4 )(l — X 6 ) ... (1 — x 2k ) 

22 . Let {” } denote the number of partitions of n distinct objects into k nonempty 
subsets. Show that J"^ 1 } = /.{J, 1 } + 

23 . Write a computer program to determine the number of all partitions of an inte- 
ger r 

(a) Into k parts 

(b) Into any number of parts 

(c) Into distinct parts 



6.4 EXPONENTIAL GENERATING FUNCTIONS 

In this section we discuss exponential generating functions. They are used to model 
and solve problems involving arrangements with repetition. The generating functions 
used in the previous sections to model selection with repetition problems are called 
ordinary generating functions. Exponential generating functions involve a more 
complicated modeling process than do ordinary generating functions. Consider the 
problem of finding the number of different words (arrangements) of four letters when 
the letters are chosen from an unlimited supply of as, b s and cs, and the word must 
contain at least two as. The possible selections of four letters to form the word 
are {a, a, a, a}, {a, a, a, b }, [a, a, a, c}, {a, a, b, b }, {a, a, b, c}, and {a, a, c, c}. The 
number of arrangements possible with each of these six selections is 

4! 4! 4! 4! 4! 4! 

4!0!0! 3! 1 !0! 3!0!1! 2!2!0! 2! 1 ! 1 ! 2!0!2! 

respectively. So the total number of words will be the sum of these six terms. 

The selections of letters used in such a word range over all sets of four letters 
chosen with repetition from as, bs, and cs with at least two as. Equivalently, the 
number of such selections equals the number of integer solutions to the equation 

ei+e 2 + e3=4 2<ei, 0 < C 2 , e^ (1) 

At first sight, the four-letter words problem seems similar to previous problems 
that could be modeled by (ordinary) generating functions. However, in this case 
we do not want each integer solution of ( 1 ) to contribute 1 to the count of the num- 
ber of possible words. Instead it must contribute 4! /(ei !e 2 !« 3 !) words. In terms of 
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generating functions, the coefficient of x 4 will count all formal products with associ- 
ated coefficients 


(ei + e 2 + e 3 )l 

ei 1^2 ! 


2 < e\, 0 < e 2 , e 3 


whose exponents sum to 4 (a much harder problem). Fortunately, exponential gener- 
ating functions yield formal products of exactly this form. 

An exponential generating function g(x) for a, , the number of arrangements 
with r objects, is a function with the power series expansion 

2 3 x r 

g(x) — ao + a\x + «2~ +03 — H — ■ + a r — H — ■ 

2! 3! r\ 

We build exponential generating functions in the same way that we build ordinary 
generating functions: one polynomial factor for each type of object; each factor has a 
collection of powers of x that are an inventory of the choices for the number of objects 
of that type. However, now each power x r is divided by r\. 

As an example, let us consider the four-letter word problem with at least two as. 
We claim that the exponential generating function for the number of r-letter words 
formed from an unlimited number of as, b s, and cs containing at least two as is 


g(x) = 



l+x + - h 

2! 




( 2 ) 


The coefficient of x r in (2) will be the sum of all products (x e ' / e \ !) (x n je 2 l) 
(jc e 3 /e 3 !), where e\ + e 2 + e 3 — r, 2 < e\, 0 < e 2 , 63 . If we divide x r by r! and com- 
pensate by multiplying its coefficient by r ! , then the x r term in g(x) becomes 


E 

ei+e 2 +e 3 =r 


rl ) 

e\ !e 2 !e 3 ! J 


x 

"h 


2 < e\, 0 < e 2 , 63 


This coefficient of x r /r \ is just what we wanted. So the exponential generating func- 
tion for the number of such / -letter words is indeed the expression (2). What makes 
exponential generating functions work is the “trick” of dividing x r by rl and multi- 
plying the coefficient of x r by rl . 

Before we show how to calculate coefficients of an exponential generating func- 
tion, let us give a few other examples of exponential generating function models. 


Example 1 

Find the exponential generating function for a r , the number of r arrangements without 
repetition of n objects. 

We know that the answer is P(n, r). Since there is no repetition, the exponential 
generating function is (1 +x ) n — our old binomial friend! The coefficient of x 1 in 
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(1 + x )" is ("). However, now we want the coefficient of ^ in (1 + x)": 

[n\ n\ n\ x r 

I lx = x = 

\r J (n — r)\r\ (n — r)! r! 

Thus a r = n\/(n — r)\ = P(n, r), as expected. ■ 


Example 2 

Find the exponential generating function for a r , the number of different arrangements 
of r objects chosen from four different types of objects with each type of object 
appearing at least two and no more than five times. 

The number of ways to arrange the r objects with e, objects of the i th type is 
r ! /e i ! e 2 ! ^ 3 ! ^ 4 ! To sum up all such terms with the given constraints, we want the 
coefficient of x r /r ! in (x 2 /2! + x 3 /3! + x 4 /4! + x s /5!) 4 . ■ 


Example 3 

Find the exponential generating function for the number of ways to place r (distinct) 
people into three different rooms with at least one person in each room. Repeat with 
an even number of people in each room. 

Recall that distributions of distinct objects are equivalent to arrangements with 
repetition; see Section 5.4. The required exponential generating functions are 



There are few identities or expansions to use in conjunction with exponential 
generating functions. As a result, there are only a limited number of exponential 
generating functions whose coefficients can be easily evaluated. The fundamental 
expansion for exponential generating functions is 


2 3 

v . X X 

e = 1 + x H 1 F ■ 

2! 3! 


x 

■ + — + • 
r! 


Replacing x by nx in (3), we obtain 


= 1 


2 2 
n x 


3 3 
n x 


- nx 


r\ 


(3) 


(4) 


2! 3! 

The power series in (3) is the Taylor series for e x , which is derived in all calculus texts. 
This expansion and the companion expansion for e nx are valid for all values of x. There 
is no way to factor out a common power of x in exponential generating functions, 
since the power of x must be matched with r\ in the denominator. The best that can 
be done is to subtract the missing (lowest) powers of x from e x . For example, the 
exponential factor representing two or more of a certain type of objects can be written 


x 

2 ! 


x 

3i 


x 

4\ 


1 
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Two useful expansions derived from (3) are 


1 

2 

1 

2 


(e x 


x 2 x A x 6 
2! + 4! + 6! + "' 

(5) 

3 5 7 

X X X 

3!" + 5!" + 7!" + " ' 

(6) 


Let us work some examples using (3) to (6). 


Example 4 

Find the number of different r arrangements of objects chosen from unlimited supplies 
of n types of objects. 

In Chapter 5 we would have solved this problem by arguing that there are n 
choices for the type of object in each of the r positions, for a total of n r different 
arrangements. Now let us solve this problem with exponential generating functions. 
The exponential generating function for this problem is 

(i + ,44 + ...)" = <OW' 

By (4), the coefficient of x r /r\ in this generating function is n r . ■ 


Example 5 

Find the number of ways to place 25 people into three rooms with at least one person 
in each room. 

In Example 3, we found the exponential generating function for this problem 



To find the coefficient of x r / r\ in (e x — l) 3 , we first must expand this binomial 
expression in e x 

(e x - l) 3 = e 3x - 3e lx + 3e x - 1 


From (4), we get 


e 3x - 3e 2x + 3e x 



1 


So the coefficient of x 25 /25! is 3 25 — (3 x 2 25 ) + 3. ■ 


Example 6 

Find the number of r-digit quaternary sequences (whose digits are 0, 1,2, and 3) with 
an even number of Os and an odd number of Is. 
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The exponential generating function for this problem is 




2 


Using identities (5) and (6), we can write this expression as 


1 

2 


(e x + e~ x ) x i(e v - e~ x )e x e x = ^(e 2x -e° + e°- e~ 2x )e x e x 

= Ue 2 x -e- 2x )e 2x = -y x -\)=\ 

4 4 4 



Then for r > 0, the coefficient of x r / r ! is |4' = 4' * 1 2 3 4 5 6 7 . The simple form of this answer 
suggests that there should be some combinatorial argument for obtaining this short 
answer directly. ■ 


6.4 EXERCISES 


' Summary of Exercises Most of the first 15 exercises are similar to 
the examples in this section. Exercises 21 and 22 are a continuation of the probability 
generating function exercises in Section 6.2. 

1. Find the exponential generating function for the number of arrangements or r 
objects chosen from five different types with at most five of each type. 

2. Find the exponential generating function for the number of ways to distribute r 
people into six different rooms with between two and four in each room. 

3. Find an exponential generating function for a r , the number of r- letter words with 
no vowel used more than once (consonants can be repeated). 

4. (a) Find the exponential generating function for s nr , the number of ways to 

distribute r distinct objects into n distinct boxes with no empty box. Consider 
n a fixed constant. 

(b) Determine s nr . The number s nj /n\ is called a Stirling number of the second 
kind. 

5. Find the exponential generating function, and identify the appropriate coefficient, 
for the number of ways to deal a sequence of 13 cards (from a standard 52-card 
deck) if the suits are ignored and only the values of the cards are noted. 

6 . How many ways are there to distribute eight different toys among four children 
if the first child gets at least two toys? 

7. How many r-digit ternary sequences are there with 

(a) An even number of Os? 

(b) An even number of Os and even number of 1 s? 

(c) At least one 0 and at least one 1 ? 
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8 . 

9 . 

10 . 


11 . 

12 . 

13 . 


14 . 

15 . 

16 . 
17 . 


18 . 

19 . 

20 . 


How many ways are there to make an /‘-arrangement of pennies, nickels, dimes, 
and quarters with at least one penny and an odd number of quarters? (Coins of 
the same denomination are identical.) 

How many 10-letter words are there in which each of the letters e,n, r, s occur 
(a) At most once? (b) At least once? 


How many /'-digit ternary sequences are there in which 

(a) No digit occurs exactly twice? 

(b) 0 and 1 each appear a positive even number of times? 


How many /'-digit quaternary sequences are there in which the total number of 
Os and 1 s is even? 

Find the exponential generating function for the number of ways to distribute r 
distinct objects into five different boxes when b\ < bi < 4, where h\ , /g are the 
numbers of objects in boxes 1 and 2, respectively. 

(a) Find the exponential generating functions for p r , the probability that the first 
two boxes each have at least one object when r distinct objects are randomly 
distributed into n distinct boxes. 


(b) Determine p r . 

Find an exponential generating function for the number of distributions of r 
distinct objects into n different boxes with exactly m nonempty boxes. 

Find an exponential generating function with 
(a) a r = l/(r + 1) (b) a r = r! 


Show that if g(x) is the exponential generating function for a,-, then g®(0) = eg, 
where g (/c) (x) is the Arth derivative of g(/c). 


If 


and 


then show that 


f( X ) = Y J a r~ 7 £(*)= 

r—0 r—0 

oo r 

h{x) = f(x)g(x) = ^2 c r - - 


r= 0 



Show that e x e y = e x+y by formally multiplying the expansions of e x and e y 
together. 

Find a combinatorial argument to show why the answer in Example 6 is 4 , _I . 
Show that e x /(l — x) n is the exponential generating function for the number of 
ways to choose some subset (possibly empty) of r distinct objects and distribute 
them into n different boxes with the order in each box counted. 
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21 . A Poisson random variableXhas p r = Prob(x = r) = £e '' . Find the probability 
generating function forX (see Exercise 34 in Section 6.2). 

22 . Let Pix) — YZo Pk xk t> e ^ le probability generating function for the discrete 
random variable X — that is, p k is the probability that X — k. 

(a) Show that the exponential generating function for m k , the At !r moment of 
X, m k = j k Pj is P(e x ). 

(b) The Ath factorial moment of X , m j; , is defined to be equal to 
Y^jLk ./ • /U — &)! Pj- Show that the exponential generating function for m* k 
is P(x + 1). 

(c) If X is the number of heads when n coins are flipped, find m i , m 2 , m*. (Hint: 
Use Exercise 16.) 


(d) If X is Poisson (see Exercise 21), find m.\ and ml. 



6.5 A SUMMATION METHOD 

In this section we show how to construct an ordinary generating function h(x) whose 
coefficient of x r is some specified function p(r) of r, such as r 1 or C(r, 3). Then 
we use h(x) to calculate the sums /?(()) + p( 1) H — • + p(n), for each positive n. The 
following four simple rules for constructing a new generating function from an old one 
will be used repeatedly in this section. Assume that A(x) — ^ a n x n , li (x ) = ^ b n x„, 
and C(x) — c„x n . 

Rule 1. If b„ = da„, then B(x) — dA(x), for any constant d. 

Rule 2. If c n = a„ + b n , then C(x) = A(x) + B(x). 

Rule 3. If c„ = Yl'i=o a 'bn-i , then C(x) = A(x)B(x). 

Rule 4. If b n = a„-k, except b, = 0 for i < k , then B(x) = x k A(x). 

Rule (3) is simply expression (6) from Section 6.2. The other rules are immediate. 

The other basic operation for our coefficient construction is to multiply each 
coefficient a r in a generating function g(x) by r. We claim that the new generating 
function g*(jc) with a* = ra r is obtained by differentiating g(x) and then multiplying 
by x— that is, g*(x) = x (£g(x)). If 


g(x) = ao + a\x + a^x 2 + a$x 3 H — • + a r x‘ + • • • 


( 1 ) 


then differentiation of g(x) yields 

— g(x) = a\ + 2 a~>x + 3 a^x 2 + • — b ra,x'~ l + • • • (2) 

dx 
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and now multiplying by x (Rule 4) gives 


g *(x) = x 



— a\x + 2 a 2 X 2 + 3fl3X 3 + • • • + ra r x r + ■ ■ ■ 


(3) 


Note that the a f) term in g(x) disappears in (2) because Oao = 0. 

Combining this operation with Rules (1) and (2), we can repeatedly multiply 
the coefficient of x r by r or a constant and can add such coefficients together. This 
permits us to form polynomials in r. 

The natural question now is: To what coefficients do we apply these operations? 
The natural answer is: When in doubt start with the unit coefficients a r — 1 of the 
generating function 


— l+x + x“ + x 3 + •••+x , + ••• 

1 — x 


Example 1 

Build a generating function h{x) with a,- — 2 r 2 . 

Starting with 1/(1 — x), we multiply its coefficients first by r using the generating 
function operations shown in Eqs. (2) and (3) to obtain 


1 


dx 1 — x 


1 


(1-x) 2 


= lx + 2x 2 + 3x 3 H b rx r H 


Now we repeat these operations on x/(l — x) 2 to obtain 
d x 


*(1 + x ) = i2 x + 2 2 x 2 + 3 2 x 3 f- r V + . 


K dx ( 1— x) 2 / (1— x) 3 

Finally we multiply by 2 to obtain the required generating function 


h(x) = 2x{l+x ^ = (2 x l 2 )x + (2 x 2 2 )x 2 + (2 x 3 2 )x 3 
(1 -x) 3 

H b (2 x r 2 )x' H ■ 


Example 2 

Build a generating function h(x) with a r — (r + 1 )r(r — 1). 

We could multiply (r + I )r(r — 1) out getting a r = r 3 —r, obtain generating 
functions for r 3 and r as an Example 1, and then subtract one generating function 
from the other. It is easier, however, to start with 3!(1 — x) 4 , whose coefficient a r 
equals 


a r = 3! 


r + 4- 1 


= 3! 


(r + 3)! (r + 3)! 


= (r + 3)(r + 2)(r + 1) 


r 


r!3! 
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Then the power series expansion of 3!(1 — x) 4 is 



+ ••• + (/ + 3)(r + 2)(r + l)x r + • • • 


(4) 


Compare (4) with the desired generating function 

h(x) = (3 x 2 x l)x 2 + (4 x 3 x 2)x 3 + (5 x 4 x 3)x 4 
H h (r + l)r(r - l)x r 4 

The generating function h(x) that we seek is just the series in (4) multiplied by x 2 . 
So h(x) — 3!x 2 (l — x)~ 4 . ■ 

Generalizing the construction in Example 2, we see that (n — 1)!(1 — x)~" has a 
coefficient 

a r = (n — l)!C(r + n — 1, r ) = [r + (n - l)][r + (n -2)] • • • (r + 1) 

Any coefficient involving a product of decreasing terms, such as (r + 1 )r(r — 1), can 
be built from (n — 1)!(1 — x)~" as in Example 2, where n — 1 is the number of terms 
in the product. 

Thus far the construction of an h(x) with specified coefficients has been just 
an exercise in algebraic manipulations of generating functions. The following easily 
verified theorem gives some purpose to this exercise. 

Theorem 

If li(x) is a generating function where a,, is the coefficient of x r , then IT(x) = 
/?(x)/(l — x) is a generating function of the sums of the a r s. That is, 



h*{x) — ao + (ao + ai)x + («o + ai + a{)x 2 H V I ^ 


This theorem follows from Rule (3) for the coefficients of the product h*(x ) = 
f(x)h(x), where f(x) = 1/(1 — x). Now we return to the previous examples. 

Example 1: (continued) 

Evaluate the sum 2x 1 2 + 2x 2 2 + 2 x 3 2 H b 2 « 2 . 

The generating function h(x) for a r =2r 2 was found in Example 1 to be 
2x(l + x)/(l — x) 3 . Then by the theorem, the desired sum a\ + an H — ■ + a n is the 
coefficient of x" in 


h*(x) — h(x)/( 1 — x) = 2x(l +x)/(l — x) 4 
= 2x(l — x) -4 + 2x 2 (1 — x) -4 
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The coefficient of x" in 2xi 1 — x)“ 4 is the coefficient of x n ~ 1 in 2(1 — x) -4 , and the 
coefficient of x” in 2x 2 (l — x ) 4 is the coefficient of x" -2 in 2(1 — x) 4 . Thus, the 
given sum equals 


/(«— l) + 4— 1\ (in- 2) + 4— 1\ 

V (n-1) / V (n-2) y 



Example 2: (continued) 

Evaluate the sum 3x2x 1+4 x3x 2+- — f- (« + l)n(n — 1). 

The generating function h(x) for a r = (r + 1 )r(r — 1) was found in Example 2 
to be h (x ) = 6x 2 ( I — x ) 4 . By the theorem the desired sum is the coefficient of 
x” in h*(jx) = h(x)/(\ — x) = 6x 2 (l — x) . This coefficient is the x" -2 term in 
6(1 — x)” 5 — namely, 6 C((n — 2) + 5— 1, « — 2) — 6 C(n + 2, 4). ■ 


Note that these two summation problems can also be solved with the summation 
method in Section 5.5 based on binomial identities. For comparison, see the analysis 
of the sum in Example 2 given in Example 4 of Section 5.5 [note that the general 
term of this sum in Section 5.5 is (n — 2 )(« — 1 )n instead of (n + 1 )n(n — 1)]. Both 
methods have their advantages. 



2 . 


6.5 EXERCISES 


Find ordinary generating functions whose coefficient a r equals 

(a) r (b) 13 (c) 3r 2 (d)3r + 7 (e) r(r - l)(r - 2)(r - 3). 

Evaluate the following sums (using generating functions): 

(a) 0+1 + 2 + - — \-n 

(b) 13 + 13 4 1- 13 

(c) 0+3+ 12 H b3« 2 

(d) 7 + 10 + 13 4 f (3n + 7) 

(e) 4x3x2xl+5x4x3x2 + - — h n(n — 1)(« — 2)(n — 3) 


3. Find a generating function with a r = r(r + 2) (do not add together generating 
functions for r 1 and for 2r). 

4. (a) Show how r 2 and r ’ can be written as linear combinations of P(r, 3), Pir. 2), 

and P(r, 1). 

(b) Use part (a) to find a generating function for 3r 3 — 5r 2 + 4r. 

5. Find a generating function for 

(a) a r = (r - l) 2 (b)a r = l/r 

6. If h (x ) is the ordinary generating function for a r , what is the coefficient of x' in 
h(x)( 1 — x) (give your answer in terms of the a r s)? 

7. Verify Theorem 1 in this section. 
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8. If h(x) is the ordinary generating function for a r , find the generating function for 


s r = YlkLr+i clk ’ assuming all s r s are finite and a r — ► 0 as r —> oo. 



6.6 SUMMARY AND REFERENCES 



Generating functions are at once a simple-minded and a sophisticated mathematical 
model for counting problems; simple-minded because polynomial multiplication is a 
familiar, seemingly well understood part of high school algebra, and sophisticated be- 
cause with standard algebraic manipulations on generating functions, one can solve 
complicated counting problems. These algebraic manipulations automatically per- 
form the correct combinatorial reasoning for us! Note that generating functions are 
an elementary example of the algebraic approach that pervades the research frontiers 
of contemporary mathematics. Letting algebraic expressions, whether they model 
combinatorial, geometric, or functional information, do the work for us is what much 
of modern mathematics is all about. 

In this chapter, generating functions were used to model selection and arrange- 
ment problems with constrained repetition. Partition problems were also modeled (but 
were not solved). Finally, we showed how to construct generating functions whose 
coefficient for x r was a given function of r and used these generating functions to 
evaluate related sums. In the next three chapters, generating functions will be used to 
model and solve other combinatorial problems. Exercise 38 of Section 6.2 introduces 
one type of generating function used in probability theory (for more, see Feller [2]). 
Laplace and Fourier transforms in analysis are also generating functions [the Fourier 
transform of a function f(t) can be viewed as a generating function for the Fourier 
coefficients of /(/)]. 

The first use of combinatorial generating functions was by DeMoivre around 
1720. He used them to derive a formula for Fibonacci numbers (this derivation is 
given in Section 7.5). In 1748, Euler used generating functions in his work on par- 
tition problems. The theory of combinatorial generating functions, developed in the 
late eighteenth century, was primarily motivated by parallel work on probability gen- 
erating functions (see Exercises 38 to 42 in Section 6.2 and Exercises 21 and 22 in 
Section 6.4). Laplace made many contributions to both theories and presented the first 
complete treatment of both in his 1812 classic Theorie Analytique des Probability. 

For a full discussion of the use of generating functions in combinatorial math- 
ematics, see MacMahon [3]. For a nice presentation of partition problems, also see 
Cameron [ 1 ] . 

1. P. Cameron, Combinatorics: Topics, Techniques, Algorithms, Cambridge Univer- 
sity Press, Cambridge, 1994. 

2 . W. Feller, An Introduction to Probability Theory and Its Applications, vol. I, 2nd 
ed., John Wiley & Sons, New York, 1957. 

3 . P. MacMahon, Combinatory Analysis, vols. I and II (1915), reprinted in one vol- 
ume, Chelsea Publishing, New York, 1960. 
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RECURRENCE 

RELATIONS 



RECURRENCE RELATION MODELS 


In this chapter we show how a variety of counting problems can be modeled with 
recurrence relations. We then discuss methods of solving several common types of 
recurrence relations. 

A recurrence relation is a recursive formula that counts the number of ways to 
do a procedure involving n objects in terms of the number of ways to do it with fewer 
objects. That is, if ak is the number of ways to do the procedure with k objects, for 
k — 0 , 1 , 2 ,..., then a recurrence relation is an equation that expresses a„ as some 
function of preceding a 2 s, k < n. The simplest recurrence relation is an equation 
such as a n = 2a„-\ . The following equations display some of the forms of recurrence 
relations that we will build to model counting problems in the chapter: 

a„ = + c 2 a „_ 2 H — • + c r a„- r where c,- are constants 

a„ = ca n - 1 + f(n) where f(n) is some function of n 
a„ = aoa n -i + a\a n -2 + • • • + a n -\ao 
®n,m — Un- I ,m T @n — l,m—l 

Just as mathematical induction is a proof technique that verifies a formula or 
assertion by inductively checking its validity for increasing values of n, so a recurrence 
relation is a counting technique that solves an enumeration problem by recursively 
computing the answer for successively larger values of n. 

The observant reader will remember that mathematical induction also involves 
an initial step of verifying the formula or assertion for some starting (smallest) value 
of n. The same is true for a recurrence relation. We cannot recursively compute the 
next a„ unless some initial values are given. If the right-hand side of a recurrence 
relation involves the r preceding a^s, then we need to be given the first r values, 
ao, ai, , a r - 1 . For example, in the relation a„ = a ( ,_| + a„_ 2 , knowing only that 
«o = 2 is insufficient. If given also that = 3, then we use the relation to obtain 
a 2 = «i + ao = 3 + 2 = 5; <23 = + fli = 5 + 3 = 8 ; CI 4 = 8 + 5 = 13; and soon. The 

information about starting values needed to compute with a recurrence relation is 
called the initial conditions. 
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If we can devise a recurrence relation to model a counting problem we are 
studying and also determine the initial conditions, then it is usually possible to solve 
the problem for moderate sizes of n, such as n — 20, quickly by recursively computing 
successive values of a„ up to the desired n. For larger values of n, a programmable 
calculator or computer is needed. 

For many common types of recurrence relations, there are explicit formulas for 
a„. Sections 1 .2-1 A discuss some of these solutions. However, it is frequently easier 
to determine recursively the value of ao,ai,... up to the desired value, say, an than 
to compute an from a complicated general formula for a n . 

Example 1: Arrangements 

Find a recurrence relation for the number of ways to arrange n distinct objects in a 
row. Find the number of arrangements of eight objects. 

Let a„ denote the number of arrangements of n distinct objects. There are n choices 
for the first object in the row. This choice can be followed by any arrangement of 
the remaining n — 1 objects; that is, by the a„_ arrangements of the remaining n — 1 
objects. Thus a„ = n«„_i . Substituting recursively in this relation, we see that 

a„ — na„-\ = n[{n — l)a„_ 2 ] = • • • = n(n — 1 ){n — 2) ■ • • x 2 x 1 =n\ 

In particular, as = 8! ■ 

Of course, we already know that the number of arrangements of n objects is n\ 
It is often useful to try out a new technique on an old problem to see how it works 
before using it on new problems. Now for some new problems. 


Example 2: Climbing Stairs 

An elf has a staircase of n stairs to climb. Each step it takes can cover either one stair 
or two stairs. Find a recurrence relation for a n , the number of different ways for the 
elf to ascend the n-stair staircase. 

It is easy to check with a figure that ai — 1, a 2 — 2 (two one-steps or one two- 
steps) and a 3 = 3. For n — 4, Figure 7.1a depicts one way of climbing the stairs, 
taking successive steps of sizes 1,2, 1. Other possibilities are one two-stair step either 
preceded by or followed by or in between two one-stair steps, or two two-stair steps, 
or four one-stair steps. In all, we count five ways to climb the four stairs; so <24 = 5. 



Figure 7.1 
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Is there some systematic way to enumerate the ways to climb four stairs that 
breaks the problem into parts involving the ways to climb three or fewer stairs? 
Clearly, once the first step is taken there are three or fewer stairs remaining to climb. 
Thus we see that after a first step of one stair, there are <73 ways to continue the climb 
up the remaining three stairs. If the first step covers two stairs, then there are 02 ways 
to continue up the remaining two stairs. So <34 = a 3 + a 2 - We confirm that the values 
for fl 4 , fl 3 , 02 satisfy this relation: 5 = 3 + 2. This argument applies to the first step 
when climbing any number of stairs, as is shown in Figures 7.1 b and 7.1c. Thus 
a n — a n — 1 + a n — 2 - ■ 

In Section 7.3 we obtain an explicit solution to this recurrence relation. The rela- 
tion a n = a n - 1 + a „_2 is called the Fibonacci relation. The numbers a„ generated by 
the Fibonacci relation with the initial conditions ao — «i = 1 are called the Fibonacci 
numbers. They begin 1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89. Fibonacci numbers arise natu- 
rally in many areas of combinatorial mathematics. There is even a journal, Fibonacci 
Quarterly, devoted solely to research involving the Fibonacci relation and Fibonacci 
numbers. Fibonacci numbers have been applied to other fields of mathematics, such as 
numerical analysis. They occur in the natural world — for example, the arrangements 
of petals in some flowers. For more information about the occurrences of Fibonacci 
numbers in nature, see [ 1 ], 

Example 3: Dividing the Plane 

Suppose we draw n straight lines on a piece of paper so that every pair of lines intersect 
(but no three lines intersect at a common point). Into how many regions do these n 
lines divide the plane? 

Again we approach the problem initially by examining the situation for small 
values of n. With one line, the paper is divided into two regions. With two lines, 
we get four regions — that is, a^ = 4. See Figure 1 2a. From Figure 7.2/;, we see that 
Ci 3 = 7. The skeptical reader may ask: how do we know that three intersecting lines 
will always create seven regions? Let us go back one step, then. 

Clearly two intersecting lines will always yield four regions, as shown in Figure 
7.2 a. Now let us examine the effect of drawing the third line (labeled “3” in Figure 
7.2 b). It must cross each of the other two lines (at different points). Before, between, 
and after these two intersection points, the third line cuts through three of the regions 
formed by the first two lines (this action of the third line does not depend on how it is 
drawn, just that it intersects the other two lines). So in severing three regions, the third 
line must form three new regions, actually creating six new regions out of three old 
regions. Thus <33 = a 2 + 3 = 4 + 3 = 7, independently of how the third line is drawn. 
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Similarly, the fourth line severs four regions before, between, and after its three 
intersection points with the first three lines (see Figure 7.2 c), so that <74 = a$ + 4 = 
7 + 4 = 11. In general, the nth line must sever n regions before, between, and after 
its n — 1 intersection points with the first n — I lines. So a n = a n -\ +n. u 


Example 4: Tower of Hanoi 

The Tower of Hanoi is a game consisting of n circular rings of varying size and three 
pegs on which the rings fit. Initially all the rings are placed on the first (leftmost) peg 
with the largest ring at the bottom covered by successively smaller rings. See Figure 
7.3 a. By transferring the rings among the pegs, one seeks to achieve a similarly 
tapered pile on the third (rightmost) peg. The complication is that each time a ring is 
transferred to a new peg, the transferred ring must be smaller than any of the rings 
already piled on this new peg; equivalently, at every stage in the game there must be 
a tapered pile (or no pile) on each peg. 

Find a recurrence relation for a n , the minimum number of moves required to play 
the Tower of Hanoi with n rings. How many moves are needed to play the six-ring 
game? Try playing this game with a dime, penny, nickel, and quarter (four rings) 
before reading our solution. 

The key observation is that if, say, the six smallest rings are on peg A and we 
want to move them to peg C, we must first “play the five-ring Tower-of-Hanoi game” 
to get the five smallest rings from peg A to peg B, then move the sixth smallest ring 
from peg A to peg C, and then again “play the five-ring game” from peg B to peg C. 
See Figure 7.3. (Of course, to move the five smallest rings we must move the fifth 
smallest from A to B, which means playing four-ring games from A to C and from C 
to B, etc.) Thus to move the n rings from A to C, the n — 1 smallest rings must first 
be moved from A to B, then the largest (nth) ring moved from A to C, and then the 
n — 1 smallest rings moved from B to C. 

If a n is the number of moves needed to transfer a tapered pile of n rings from one 
peg to another peg, then the previous sentence yields the following recurrence relation: 
a„ — a „- 1 + 1 + a„-i = 2a n -\ + 1. The initial condition is a\ = 1, and so 02 = 2a \ + 
1 = 3; (73 = 2a 2 + 1 = 7; <74 = 2*73 + 1 = 15; 05 = 2a\ +1 = 31; and ag = 2a$ + 1 = 
63. So the six-ring game requires 63 moves. Note that the a n s thus far fit the formula 
a„ = 2" — 1 . ■ 




Figure 7.3 Tower of Hanoi 


7.1 Recurrence Relation Models 287 


Example 5: Money Growing in a Savings Account 

A bank pays 4 percent interest each year on money in savings accounts. Find re- 
currence relations for the amounts of money a gnome would have after n years if it 
follows the investment strategies of 

(a) Investing $1,000 and leaving it in the bank for n years 

(b) Investing $100 at the end of each year 

If an account has x dollars at the start of a year, then at the end of the year (i.e., 
start of the next year) it will have x dollars plus the interest on the x dollars, provided 
no money was added or removed during the year. Then for part (a), the recurrence 
relation is a„ = a„_i + .04a,j_i = 1.04o n _i. The initial condition is ao = 1000. For 
part (b), the relation must reflect the $100 added (which earns no interest since it 
comes at the end of the year). So a n = \ .04a fI . i + 100, with ao = 0. ■ 

A wide range of common financial activities can be modeled by recurrence 
relations, including college saving plans and mortgages. A mortgage is similar to part 
(b) of Example 5, except now the value of the account is what you owe, not what 
you own. One starts with a large initial debt — e.g., a o might be 300,000 — and each 
period, typically one month, interest adds to the debt while monthly payments are 
large enough to offset the interest and slowly reduce the debt. Closed-form solutions 
to recurrence relations lead to the formulas that bankers use to determine the monthly 
payments for a mortgage of a given size, interest rate, and term. 

Example 6: Making Change 

Find a recurrence relation for the number of different ways to hand out a piece of 
chewing gum (worth 17 ) or a candy bar (worth 10 7 ) or a doughnut (worth 207 ) on 
successive days until mf, worth of food has been given away. 

This problem can be treated similarly to the stair-climbing problem in Example 2. 
That is, if on the first day we hand out 17 worth of chewing gum, we are left with 
(n — 1)7 worth of food to give away on following days; if the first day we hand out 
107 worth of candy, we have ( n — 10)7 worth to dispense the next days; and if 207 
then ( n — 20)7 the next days. So a n = a n - 1 + a n - 10 + a „_ 20 with ao — 1 (one way to 
give nothing — by giving 0 pieces of each item), and implicitly, — O for k < 0. ■ 

Example 7: A Forbidden Subsequence 

Find a recurrence relation for a n , the number of /7-digit ternary sequences without any 
occurrence of the subsequence “012.” 

Recall that a ternary sequence is a sequence composed of 0s, Is, and 2s. We 
start with the same analysis used in the elf stair-climbing problem. If the first digit 
in an //-digit ternary sequence is 1, then there are a n -\(n — l)-digit ternary sequences 
without the pattern 012 that can follow that initial 1. Similarly if the first digit is 2. 

However, there is a problem if the first digit is a 0. Among the (n— 1 )-digit ternary 
sequences without the pattern 012 that might follow the initial 0 are sequences that 
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start with 12. While such ( n — l)-digit sequences do not contain the pattern 012, the 
n-digit sequences of 0 followed by such in — l)-digit sequences do start with the 
pattern 012. We correct this mistake by subtracting off all n-digit sequences starting 
with 012 (but with 012 not appearing later in the sequence). Such sequences are 
fonned by 012 followed by any ( n — 3)-digit ternary sequence with no 012 pattern — 
there are a „- 3 such sequences. Thus the desired recurrence relation is a„ = a n _ 1 + 
a n —\ -f~ {cin—i tin— 3 ) — 2>a n —\ Q n — 3 . ■ 

The preceding examples developed recurrence relation models either by breaking 
a problem into a first step followed by the same problem for a smaller set (Examples 
1, 2, 6 and 7) or by observing the change in going from the case of n — 1 to the case 
of n (Examples 3 and 5). Example 4 was a modified form of the former procedure in 
which the move of the nth ring (largest ring) was seen to be preceded and followed 
by an (n — l)-ring game. 

There are two simple methods for solving some of the relations seen thus far. The 
first is recursive backward substitution: wherever a n -\ occurs in the relation for a„ , we 
replace a n - 1 by the relation’s formula for a „- 1 (involving a„_ 2) and then replace a„_ 2, 
and so on. In Example 3, the relation a„ = a n - 1 + n becomes a n = (a „_ 2 + n — 1) + 
n = -- - = l + 2 + 3 + - — fn — 1+n. We used this method in Example 1 to obtain 
a n — n(n — l)(n — 2) x • • • x 2 x 1. 

The second method is to guess the solution to the relation and then to verify it 
by mathematical induction. In Example 4, we noted that a n = 2" — 1 for the first six 
values of the recurrence relation a„ = 2a n - \ + 1, cq = 1. We prove a„ = 2" — 1 by 
induction as follows. It is seen to be true for n = 1. Assuming a n _\ = 2" _1 — 1, we 
then have a„ = 2a„_i + 1 = 2(2"“' - 1) + 1 = 2" - 2 + 1 = 2" - 1 . 

We now consider more complex recurrence relations involving two variable re- 
lations and simultaneous relations. 

Example 8 : Selection Without Repetition 

Let a n j : denote the number of ways to select a subset of k objects from a set of n 
distinct objects. Find a recurrence relation for a,, *.. 

Observe that a n ^ is simply . We break the problem into two subcases based on 
whether or not the first object is used. There are a„_u- ^-subsets that do not use the first 
object, and there are a n -\ t k~\ ^-subsets that do use the first object. So we have «„ /, = 
a?i-i,k + dn-i,k-\- This is the Pascal’s triangle identity [identity (3) from Section 5.5]. 
The initial conditions are a„ Q = a n „ = 1 for all n > 0 (and a„ k = 0, k > n). ■ 

Example 9: Distributions 

Find a recurrence relation for the ways to distribute n identical balls into k distinct 
boxes with between two and four balls in each box. Repeat the problem with balls of 
three colors. 

In the spirit of the previous example, we consider how many balls go into the first 
box. If we put two in the first box (one way to do this), then there are a n -2.k-\ ways 
to put the remaining n — 2 identical balls in the remaining k— 1 boxes. Continuing 
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this line of reasoning, we see that a„ = a n _2,k~i +a n - 3,4-1 + £4-4,*;- 1 ■ The initial 
conditions are «2,i = 03,1 = 04,1 = 1 and a„i = 0. 

If now three colors are allowed, there are C(2 + 3 — 1,2) = 6 ways to pick a subset 
of two balls for the first box from three types of colors with repetition. (Review the start 
of Section 5.3 for a discussion of selection with repetition.) Similarly, there are C( 3 + 
3 — 1,3)= 1 0 ways to pick three balls from three types and C(4 + 3 — 1,4)= 15 ways 
to pick four balls from three types. Then a n ^ — 6a n _2,k~i + 10a„_3 ^_i + 15n„_4 
with initial conditions <22. 1 = 6, 03,1 = 10, £4,1 = 15, and a n \ = 0. 

The problem with all balls identical can be solved by generating functions (see 
Example 3 of Section 6.1), but recurrence relations are the only practical approach 
with the extra constraint of different types of balls. ■ 

Example 10: Placing Parentheses 

Find a recurrence relation for a n , the number of ways to place parentheses to multiply 
the n numbers k\ x k 2 x k\ x k 4 x ■ • • x k n on a calculator. 

To clarify the problem, observe that there is one way to multiply (k \ x A^), so 
«2 = 1. There are two ways to multiply k\ xl? x k\, namely, [fAq x ki_) x kj] and 
[k\ x (ki_ x Ay)]; so <23 = 2. It is not clear what a () and a \ should be, but to make 
the eventual recurrence relation have a simple form, we let ao — 0 and a\ = 1. To 
find a recurrence relation for a n , we look at the last multiplication (the outermost 
parenthesis) in the product of the n numbers. This last multiplication involves the 
products of two multiplication subproblems: 


(ki x k2 x • • • x ki ) x ( kj + \ x A (+ 2 x • • • x k n ) 


where i can range from 1 to n — 1. The numbers of ways to parenthesize the two 
respective subproblems are <2, and , and so there are «,£/„_, ways to parenthesize 
both subproblems. Summing over all i, we obtain the recurrence relation (for n >2) 
a n — a.\a n — 1 -f- Q2^n—2 T . . ■ T a n — \a \ • ■ 

Example 11: Systems of Recurrence Relations 

Find recurrence relations for 

(a) The number of n-digit ternary sequences with an even number of Os 

(b) The number of n-digit ternary sequences with an even number of Os and an even 
number of Is 

(a) We use the first-step analysis of the stair-climbing model to find a recurrence 
relation for a n , the number of /(-digit ternary sequences with an even number of Os. 
If an /(-digit ternary sequence starts with a 1, then we require an even number of 
Os in the remaining (// — l)-digit sequence — cz„_i such sequences. Similarly, if an 
/(-digit ternary sequence starts with a 2, there are a n -\ (n — l)-digit sequences with 
even Os. If the //-digit sequence starts with a 0, we require an odd number of 0s in 
the remaining n— 1 digits — 3" -1 — a„_i such sequences, since all 3 "~ l (n — l)-digit 
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sequences minus the even-Os (n — l)-digit sequences yields the odd-Os (n — l)-digit 
sequences. In sum, a„ = 2a„-\ + (3" _1 — a„_i) = a n + 3' !_1 . 

(b) We will need simultaneous recurrence relations for a n , the number of 
n-digit ternary sequences with even Os and even Is; h„ , the number of n-digit 
ternary sequences with even Os and odd Is; and c„, the number of n-digit se- 
quences with odd Os and even Is. Observe that 3" — a n — b n — c„ is the number 
of n-digit ternary sequences with odd Os and odd Is. An n-digit ternary sequence 
with even Os and even Is is obtained either by having a 1 for the first digit fol- 
lowed by an (n — l)-digit sequence with even Os and odd Is, or a 0 followed by an 
(n — l)-digit sequence with odd Os and even Is, or a 2 followed by an (n — l)-digit 
sequence with even Os and even Is. Thus a„ = b „- 1 +c„_i +a„_i. Similar anal- 
yses yield b„ = a„_ i + (3 ,!_1 - a„_i - b n _ i - c„_i) + b n _\ = 3"' 1 - c„_i and c„ = 
a „- 1 + (3" _1 — a „- 1 — b n -\ — c„_i) + c„_i = 3" _1 — b n ~\. The initial conditions are 
a\ — b\ — c\ = 1. To recursively compute values for a n , we must simultaneously 
compute b n and c n . u 

We close this section with a few words about difference equations. The first 
(backward) difference A a n of the sequence («o, ct\ , 02 , . . .) is defined to be A a n — 
a„ — a, i_i . The second difference is A 2 a n = A a n — Aa„_| = a n — 2c/ f ,_| + a„_ 2 , and 
so on. A difference equation is an equation involving a n and its differences, such as 
2 A 2 a n — 3 A a n + a„ = 0. Observe that 

a n —\ — a n (a n a n —\) — a n A a n 

a „-2 = a„-i — Afl„_i = ( a„ — A a„) — A (a„ — A a n ) = a n — 2A a n + A 2 a„(*) 

Similar equations can express a n -k in terms of a n . A a n , . . . , A k a„. Thus any re- 
currence relation can be rewritten as a difference equation, by expressing the a ( ,_/,s on 
the right-hand side of (*) in terms of a n and its differences. Conversely, by writing A a n 
as a n — a„_ 1 , and so on, any difference equation can be written as a recurrence relation. 

Difference equations are commonly used to approximate differential equations 
when solving differential equations on a computer. Difference equations have wide 
use in their own right as models for dynamical systems for which differential equations 
(which require continuous functions) are inappropriate. They are used in economics 
in models for predicting the gross national product in successive years. They are 
used in ecology to model the numbers of various species in successive years. As 
noted above, any difference equation model can also be formulated as a recurrence 
relation; however, the behavior of a dynamical system is easier to analyze and explain 
in terms of differences. Refer to Sandefur [4] for further information about difference 
equations and their applications. 

Example 12: Two- Animal Population Model 

Assume that if undisturbed by foxes, the number of rabbits increases each year by 
an amount ar n , where r„ is the number of rabbits, but when foxes are present, each 
rabbit has probability fif n of being eaten by a fox (/„ is the number of foxes). Foxes 
alone decrease by an amount yf, each year, but when rabbits are present, each fox has 
probability 8r n of feeding and raising up a new young fox (death of foxes is included 
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in the yf„ term). Give a pair of simultaneous difference equations describing the 
number of rabbits and foxes in successive years. 

The information given about yearly changes in the two populations yields the 
following difference equations: 

A r„ = ar n - /3r n f, 

A f„ = -Yfi + Sr n f n m 



7.1 EXERCISES 


Summary of Exercises The first 39 exercises call for recurrence 
relation modeling similar to that in the examples, with multiple indices and equa- 
tions required in Exercises 28^M) and difference equations in Exercises 41-43. The 
remaining exercises are more advanced problems. 

1. Find a recurrence relation for the number of ways to distribute n distinct objects 
into five boxes. What is the initial condition? 

2. (a) Find a recurrence relation for the number or ways the elf in Example 2 can 

climb n stairs if each step covers either one or two or three stairs? 

(b) How many ways are there for the elf to climb four stairs? 

3. Find a recurrence relation for the number of ways to arrange cars in a row with 
n spaces if we can use Cadillacs or Hummers or Fords. A Hummer requires two 
spaces, whereas a Cadillac or a Ford requires just one space. 

4. (a) Find a recurrence relation for the number of ways to go n miles by foot 

walking at 2 miles per hour or jogging at 4 miles per hour or running at 8 miles 
per hour; at the end of each hour a choice is made of how to go the next hour. 

(b) How many ways are there to go 12 miles? 

5. Find a recurrence relation for the number of ways to distribute a total of n cents 
on successive days using 1971 pennies, 1951 nickels, 1967 nickels, 1959 dimes, 
and 1975 quarters. 

6. (a) Find a recurrence relation for the number of n-digit binary sequences with 

no pair of consecutive Is. 

(b) Repeat for n-digit ternary sequences. 

(c) Repeat for n-digit ternary sequences with no consecutive 1 s or consecutive 2s. 

7. Find a recurrence relation for the number of pairs of rabbits after n months if (1) 
initially there is one pair of rabbits who were just born, and (2) every month each 
pair of rabbits that are over one month old have a pair of offspring (a male and a 
female). 

8. Show that the binomial sum 


n + 1 
0 


n — 1 
2 


satisfies the Fibonacci relation. 
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9. Find a recurrence relation for the number of ways to arrange n dominoes to fill 
a 2-x-n checkerboard. 

10. Find a recurrence relation for a„ for the number of bees in the nth previous 
generation of a male bee, if a male bee is born asexually from a single female 
and a female bee has the normal male and female parents. The ancestral chart at 
the right shows that a i = 1, — 2, a?, — 3. 

T I T 

/ m / 

m / 

/ 


11. Find a recurrence relation for a n for the number of ways for an image to be 
reflected n times by internal faces of two adjacent panes of glass. The diagram 
below shows that ao — 1, a\ = 2, and a 2 = 3. 


0 reflections 1 reflection 2 reflections 



12. Find a recurrence relation for the number of regions created by n mutually inter- 
secting circles on a piece of paper (no three circles have a common intersection 
point). 

13. (a) Find a recurrence relation for the number of regions created by n lines on a 

piece of paper if k of the lines are parallel and the other n — k lines intersect 
all other lines (no three lines intersect at one point). 

(b) If n = 9 and k = 3, find the number of regions. 

14. Show that each of the following rules for playing the Tower of Hanoi works. 

(a) On odd-numbered moves, move the smallest ring clockwise one peg (think 
of the pegs being at the corners of a triangle), and on even-numbered moves, 
make the only legal move not using the smallest ring. 

(b) Number the rings from 1 to n in order of increasing size. Never move the 
same ring twice in a row. Always put even-numbered rings on top of odd- 
numbered rings (or on an empty peg) and put odd-numbered rings on top of 
even-numbered rings (or on an empty peg). 

15. Find a recurrence relation for the amount of money in a savings account after 
n years if the interest rate is 6 percent and $50 is added at the start of each 
year. 
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16. (a) Find a recurrence relation for the amount of money outstanding on a $30,000 

mortgage after n years if the interest rate is 8 percent and the yearly payment 
(paid at the end of each year after interest is computed) is $3,000. 

(b) Use a calculator or computer to determine how many years it will take to pay 
off the mortgage. 

17. Each day Angela eats lunch at a deli, ordering one of the following: chicken salad, 
a tuna sandwich, or a turkey wrap. Find a recurrence relation for the number of 
ways for her to order lunch for the n days if she never orders chicken salad three 
days in a row. 

18. Find a recurrence relation for the number of n -letter sequences using the letters 
A, B, C such that any A not in the last position of the sequence is always followed 
by a B. 

19. (a) Find a recurrence relation for the number of sequences of Is, 3s, and 5s 

whose terms sum to n. 

(b) Repeat part (a) with the added condition that no 5 can be followed by a 1. 

(c) Repeat part (a) with the condition of no subsequence of 135. 

20. (a) Find a recurrence relation for the number of ways to arrange three types of 

flags on a flagpole n feet high: red flags ( 1 foot high), gold flags ( 1 foot high), 
and green flags (2 feet high). 

(b) Repeat part (a) with the added condition that there may not be three 1-foot 
flags (red or gold) in a row. 

(c) Repeat part (a) with the condition of no red above gold above green (in a 
row). 

21. Find a recurrence relation for a„, the number of ways to give away $1 or $2 or 
$3 for n days with the constraint that there is an even number of days when $1 is 
given away. 

22. Find a recurrence relation to count the number of n-digit binary sequences with 
at least one instance of consecutive 0s. 

23. Find a recurrence relation for the number of n-digit quaternary (0, 1,2,3) se- 
quences with at least one 1 and the first 1 occurring before the first 0 (possibly 
no 0s). 

24. Find a recurrence relation for the number of n-digit ternary (0, 1, 2) sequence in 
which no 1 appears anywhere to the right of any 2. 

25. Find a recurrence relation for the number of n-digit ternary sequences that have 
the pattern “012” occurring for the first time at the end of the sequence. 

26. A switching game has n switches, all initially in the OFF position. In order to be 
able to flip the ;th switch, the (i — l)st switch must be ON and all earlier switches 
OFF. The first switch can always be flipped. Find a recurrence relation for the 
total number of times the n switches must be flipped to get the nth switch ON 
and all others OFF. 
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27. Find a recurrence relation for the number of ways to pair off 2 n people for tennis 
matches. 

28. Find a recurrence relation for the number of ways to pick k objects with repetition 
from n types. 

29. Find a recurrence relation for the number of ways to select n objects from k types 
with at most three of any one type. 

30. Find a recurrence relation for a n k , the number of ways to order n doughnuts from 
k different types of doughnuts if two or four or six doughnuts must be chosen of 
each type. 

31. Find a recurrence relation for the number of partitions of the integer n into k 
parts. 

32. Find a recurrence relation for the number a„ t „ h k of distributions of n identical 
objects into k distinct boxes with at most four objects in a box and with exactly 
m boxes having four objects. 

33. Find a system of recurrence relations for computing the number of n-digit binary 
sequences with an even number of Os and an even number of Is. 

34. Find a system of recurrence relations for computing the number of n-digit qua- 
ternary sequences with 

(a) An even number of Os 

(b) An even total number of Os and Is 

(c) An even number of Os and an even number of Is 

35. Find a system of recurrence relations for computing the number of n-digit binary 
sequences with exactly one pair of consecutive Os. 

36. Find a system of recurrence relations for the number of n-digit binary sequences 
with k adjacent pairs of Is and no adjacent pairs of Os. 

37. Find a system of recurrence relations for computing the number of ways to 
hand out a penny or a nickel or a dime on successive days until n cents are 
given such that the same amount of money is not handed out on two consecutive 
days. 

38. Find a recurrence relation for the number of ways to pair off 2 n points on a circle 
with nonintersecting chords. (Hint: The recurrence involves products of «*-s as 
in Example 11.) 

39. Find a recurrence relation for the number of ways to divide an n- gon into triangles 
with noncrossing diagonals. 

40. Find a recurrence relation for the number of binary trees with n labeled leaves. 

41. Find A a„ and A 2 a„ if a„ equals 

(a) 3n + 2 (b) n 2 (c) n 3 

42. Let /„ be the amount of food that can be bought with n dollars. Let p n be the 
“perceived” value of the %n of food. Suppose the increase in perceived value with 
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$1 more of food equals the relative, or percentage, increase in the actual amount 
of food. Find a difference equation relating A p n and A /„. 

43. (a) Find a recurrence relation for the number of permutations of the first n integers 

such that each integer differs by one (except for the first integer) from some 
integer to the left of it in the permutation. What is the initial condition? 

(b) Solve the relation in part (a) by guessing and verifying the guess by induction. 

(c) Give a direct combinatorial answer to this problem. 

44. (a) Find a recurrence relation for f(n, k), the number of k subsets of the integers 

1 through n with no pair of consecutive integers. 

n /2 

(b) Show that 'E k J 0 f(n, k) = F n+ \, the (n + l)st Fibonacci number (n even), 
where Fq = F\ = 1 . 

45. Find a system of recurrence relations for computing a n , the number of (un- 
ordered) collections of (identical) pennies, (identical) nickels, (identical) dimes, 
and (identical) quarters whose value is n cents. 

46. Find a recurrence relation for the number of ways a coin can be flipped 2 n times 
with n heads and n tails and 

(a) The number of heads at any time never be less than the number of tails 

(b) The number of heads equal the number of tails only after all 2 n flips 

47. Find a recurrence relation for the number of incongruent integral-sided triangles 
whose perimeter is n (the relation is different for n odd and n even). 

48. Find a system of recurrence relations for computing the number of spanning trees 
in the “ladder” graph with 2 n vertices. 



49. Verify the following identities for Fibonacci numbers (F) is the zth Fibonacci 
number) by induction or combinatorial argument. Here Fq = F\ = 1 . 

n n 

(a) F ' = F »+i ~ 1 0)) F? = F n F n+ 1 

i=0 i=0 

n 

(C) F ™ = F 2«+i (d) F n F n+ 2 = F„ 2 +i + (-1)" 

*=0 

(e) F\ — F 2 + Fi F ln = —F 2 „- i 

50. (a) Show that F n+m = F m F n + F m _ ] F n _ x . 

(b) From part (a) conclude that F„_i divides 1 . 
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7.2 DIVIDE-AND-CONQUER RELATIONS 

In this section we present a special class of recurrence relations that arise frequently in 
the analysis of recursive computer algorithms. These are algorithms that use a “divide- 
and-conquer” approach to recursively split a problem into two subproblems of half the 
size. The dictionary search in Example 3 of Section 3. 1 is such an algorithm. A binary 
tree is explicitly or implicitly associated with most “divide-and-conquer” algorithms. 
Conversely, many counting problems involving trees (the subject of Chapter 3) are 
most easily solved with divide-and-conquer recurrence relations. 

The total number of steps a n required by a divide-and-conquer algorithm to 
process an n-element problem frequently satisfies a recurrence relation of the form 

a n = ca n / 2 + f(n) (1) 

The following table indicates the form of the solution of (1) for some common values 
of c and /(«) ([>] denotes the smallest integer m with m > r): 


c 

f(n) 

a„ 

c = 1 

d 

d\ "log 2 nl + A 

c = 2 

d 

An — d 

c > 2 

dn 

"'■ + ( 2 - c )" 

c = 2 

dn 

dn{\\og 2 n'] + A) 


The constant A is to be chosen to fit the initial condition. 

If a problem is recursively split into k parts instead of two parts, then one should 
replace 2 by k everywhere in the foregoing table, except the solution for c — k and 
f(n) = d becomes An — d /(k — 1). For example, the recurrence relation 


has the solution 


a„=ca n /k + dn c^k 
a n = An'° StC + n 


( 2 ) 


The solutions fore/,, given in the foregoing table are easily verified by substitution. 
Consider the case a n — ca n/ n + dn, c > 2. Substituting the table’s solution of a„ = 
An loglC + [2d/(2 — c)]n into ca n / 2 + dn, we have 


ca n /2 + dn = c 


, n \ lo g 2 c 
A - 


(!) 


2d \ n 


2 -c 2 


■ dn 


cAn l ° S2C i edn > (2 — c)dn 
2 log 2 e + 2-c + 2-c 

cAn log2C cdn + (2 — c)dn 


2-c 


= An Xog2 


2d 

2-c 
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The following examples illustrate such “divide-and-conquer” recurrences and 
their solution. 

Example 1: Rounds in a Tournament 

In a tennis tournament, each entrant plays a match in the first round. Next, all winners 
from the first round play a second-round match. Winners continue to move on to the 
next round, until finally only one player is left — the tournament winner. Assuming that 
tournaments always involve n = 2 k players, for some k, find and solve a recurrence 
relation for the number of rounds in a tournament of n players. 

In terms of binary trees, a n is the height of a balanced binary tree with n = 2 k 
leaves. Since half the players are eliminated in each round, the number of rounds 
increases by 1 when the number of players doubles. The recurrence relation for a n , 
the number of rounds, is thus 


O n — &n / 2 + 1 

From the foregoing table, the solution of this recurrence relation is a n = log 2 n + A. 
To determine A, we observe that 0 = = log 2 1 + A = 0 + A, and so A = 0. ■ 

Example 2: Finding the Largest and Smallest Numbers in a Set 

Build a recurrence relation model to count the number of comparisons that must be 
made in the following algorithm for finding the largest number l and the smallest 
number ^ in a set S of n distinct integers. Then solve this recurrence relation. 

Initially suppose that n is an even number. Assume that we have already found 
/ 1 and ,V| , the largest and smallest numbers, respectively, in the first half of S (the 
first n/ 2 numbers) and have found / 2 and ,v 2 , the largest and smallest numbers in the 
second half of S. Then make two comparisons, one between l\ and h and the other 
between ,V| and ,v 2 , to find the largest number / and smallest number ,v in S. 

The associated recurrence relation for the number of comparisons in this proce- 
dure is a n — 2 a n n + 2, for n > 4 and even. If n is odd and we split S almost equally, 
the relation would be a n = a ( „ +i) / 2 + ci(n-i )/2 + 2. Observe that ct\ = 0, since the one 
number is both largest and smallest. And n 2 = 1, since we can determine the larger 
and smaller number in a two-element set with one comparison. With these two rela- 
tions along with a\ and a 2 , we can recursively determine the number of comparisons 
needed for any n. 

Next we solve the recurrence relation a„ = 2a n p_ + 2, n > 4, with n 2 = 1. The 
foregoing table tells us that the solution will be of the form a„ = An — 2. We confirm 
this by substituting a n = An — 2 into both sides of the relation a„ = 2a n p + 2 : 

An — 2 = a„ = 2a„/2 + 2 = 2(A — — 2) + 2 

= An — 4 + 2 = An — 2 

We can use the initial condition a 2 = 1 to determine A: 


3 

l=a 2 = Ai(2) — 2 or Ai = - 
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So a n = \n — 2 is the number of comparisons needed to find the largest and 
smallest number according to the procedure given previously. (It is possible to prove 
that one cannot do better than — 2 comparisons.) ■ 


Example 3: Efficient Multidigit Multiplication 

Normally one must do n 2 digit-times-digit multiplications to multiply two n-digit 
numbers. Use a divide-and-conquer approach to develop a faster algorithm. 

Let us initially assume that n is a power of 2. Let the two n-digit numbers be g 
and h. We split each of these numbers into two n/2-digit parts: 

g = g,10' ,/2 + g 2 /? = h l 10 n/2 + h 2 


Then 


8 x h = (gi x /? i ) 1 0” + (gi x h 2 + gi x /zi)10" /2 + g 2 x h 2 (3) 

Observe that 


gi x h 2 +gi x h i = (gi + g 2 ) x (hi+h 2 )-g i x /ii -g 2 x /; 2 


and so we need to make only three n/2-digit multiplications, g\ x hi, go x ho. and 
(gi + g 2 ) x {hi + h 2 ) to determine g x h in (3) (actually (gi + g 2 ) or (hi + h 2 ) may be 
(n /2 + l)-digit numbers, but this slight variation does not affect the general magnitude 
of our solution). If a„ represents the number of digit-times-digit multiplications needed 
to multiply two n-digit numbers by the foregoing procedure, then the procedure yields 
the recurrence a„ — 3a n / 2 . 

By the table at the start of this section (where d — 0), a„ is proportional to 
n log 2 3 = n L6 , a substantial improvement over n 2 . ■ 


In some settings, we are not so interested in an exact formula for a„ as we are 
in the general rate of growth for a n . The following theorem from Cormen, Leiserson, 
and Rivest [2] gives bounds on such growth. 

Theorem 

Let a„ — ca n /k + fin) be a recurrence relation with positive constant c and the positive 
function fin). 

(a) If for large n. f{n) grows proportional to n log ‘ e [that is, there are positive 
constants p and p' such that pn l ° Skr < fin) < p'n l ° StC ], then a„ grows proportional 
to n log * c log 2 c. 

(b) If for large n, fin) < pn q , where p is a positive constant and q < log /( c, 
then a n grows at most at a rate proportional to n l ° Sk c . 


7.2 Divide-and-Conquer Relations 299 


1.2 EXERCISES 


1. Solve the following recurrence relations assuming that n is a power of 2 (leaving 
a constant A to be determined): 


(a) 



+ 5 

(b) 


= 

+ n 

(c) 


= a „/2 + 2 n — 

(d) 

a n 

— 3fl„/3 

+4 

(e) 

Cln 

= 16a„/2 + 5n 

(f) 

G n 

— 4a„/2 

+ 3 n 


2. Find and solve a recurrence relation for the number of matches played in a 
tournament with n players, where n is a power of 2 . 

3. In a large corporation with n salespeople, every 10 salespeople report to a local 
manager, every 10 local managers report to a district manager, and so forth 
until finally 10 vice-presidents report to the firm’s president. If the firm has n 
salespeople, where n is a power of 10 , find and solve recurrence relations for 

(a) The number of different managerial levels in the firm 

(b) The number of managers (up through president) in the firm 

4. In a tennis tournament, each player wins k hundreds of dollars, where k is the 
number of people in the subtournament won by the player (the subsection of the 
tournament including the player, the player’s victims, and their victims, and so 
forth; a player who loses in the first round gets $100). If the tournament has n 
contestants, where n is a power of 2 , find and solve a recurrence relation for the 
total prize money in the tournament. 

5. Consider the following method for rearranging the n distinct numbers 
x\, X 2 , . . . , x n in order of increasing size (n is a power of 2). Pair the integers off 
[xi, X 2 1 . {x 3 , X 4 I, and so forth. Compare each pair and put the smaller number 
first. Next pair off the pairs into sets of four numbers and merge the ordered pairs 
to get ordered 4-tuples. Continue this process until the whole set is ordered. Find 
and solve a recurrence relation for the total number of comparisons required to 
rearrange n distinct numbers. {Hint: First find the number of comparisons needed 
to merge two ordered ^-tuples into an ordered 2A:-tuple). 

6 . In a standard elimination tournament, a player wins $100A' when he/she wins a 
match in the Mi round (e.g., first round win earns $100, second round win $200). 
Develop and solve a recurrence relation for a„ , the total amount of money given 
away in a tournament with n entrants, where n is assumed to be a power of 2 . 

7. Verify by substitution the form of solution given in the text to the following 
recurrence relations: 

(a) a„ = a n/2 + d 
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(b ) a n — 2a n /2 T d 

(c) a„ = 2 a n /2 + dn 

8. (a) Use a divide-and-conquer approach to devise a procedure to find the largest 

and next-to-largest numbers in a set of n distinct integers. 

(b) Give a recurrence relation for the number of comparisons performed by your 
procedure. 

(c) Solve the recurrence relation obtained in part (b). 

9. (a) Use a divide-and-conquer approach to devise a procedure to find the largest 

number in a set of n distinct integers. 

(b) Give a recurrence relation for the number of comparisons performed by your 


procedure. 

(c) Solve the recurrence relation obtained in part (b). 



7.3 SOLUTION OF LINEAR 

RECURRENCE RELATIONS 

In this section we show how to solve recurrence relations of the form 

a n = C\a n —i "H C 2 a n —2 T * ■ ■ T c r a n — r (1) 

where the c, s are given constants. There is a simple technique for solving such rela- 
tions. Readers who have studied linear differential equations with constant coefficients 
will see a great similarity between their solution and the form of solutions we discuss 
here. The general solution to ( 1 ) will involve a sum of individual solutions of the form 
a„ — a". To determine what a is, we simply substitute a k for a* in (1), yielding 

a n = cia" -1 + C 20 t' l ~ 2 H f c r a n ~ r (2) 

We can reduce the power of a in all terms in (2) by dividing both sides by a"~ r : 

a r = c\a r ~ x + C 2 U r ~ 2 H b c r (3) 

or, equivalently, 

a' — c\a ‘ _1 — C 2 & 1 -2 — c r — 0 (4) 

Equation (4) is called the characteristic equation of the recurrence relation (1). 
It has r roots, some of which may be complex (but we shall initially assume that there 
are no multiple roots). 

If a i, « 2 , are the r roots of (4), then, for any i. 0 < i < r, a n = a " is a 
solution to the recurrence relation (1). It is easy to check that any linear combination 
of such solutions is also a solution (Exercise 8). That is, 

a n = Aioi'y + A 2012 + • • • + A r a" (5) 

is a solution to (1), for any choice of constants A/, 1 < i < r. 
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Recall that for a recurrence relation involving a n _ | . a„_ 2 , . . . , on the right 
side, we need to be given the initial conditions of the first r values a\), a \ , ut, . . . , a r -\. 
Let us denote such a set of initial values by a' 0 , a\ . a' r _ l . Then the A,s must 

be chosen to satisfy the r constraints: 

Aid!) + A 2 Q !2 H b A r a k r —a' k 0<k<r—\ (6) 

The r linear equations in (6) can be solved by Gaussian elimination to determine 
the r constants A, (remember that at this stage the «,s are known). With the A, ’s 
determined, we will have the desired solution for a n , a solution that satisfies (4), and 
hence the recurrence relation (1), and satisfies the initial conditions oq = a' 0 ,a\ = 
a[, ... , a r _ i = a' r l . If the solution of the characteristic equation (4) has a root a* of 
multiplicity m, then a", not", n 2 a ", . . . , n (m ~ 1 ’a" can be shown to be the m associated 
individual solutions to be used in (5) and (6). 


Example 1: Doubling Rabbit Population 

Every year Dr. Finch’s rabbit population doubles. He started with six rabbits. How 
many rabbits does he have after eight years? After n years? 

Iffl„ is the number of rabbits after n years, then a„ satisfies the relation a n = 2a n -\ . 
We are also given ao = 6. Substituting a„ — a " , we obtain a" — 2a"' 1 or, dividing by 
a" -1 , a — 2. So a„ = 2" is the one individual solution, and a„ = A2" is the general 
solution. The initial condition is 6 = ao — A2°, or A = 6. The desired solution is then 
a n — 6 x 2". After eight years, we have = 6 x 2* = 6 x 256 = 1536 rabbits. ■ 

Example 2: Second-Order Linear Recurrence Relation 

Solve the recurrence relation ci n = 2a„-i + 3a„-2 with ao = a i = 1. 

Setting a„ = a' 1 , we get the characteristic equation 

a " = 2 a "- 1 + 3 a "- 2 


which yields 


a 2 = 2a + 3 


This may be written a 2 — 2a — 3 = 0 or (a — 3 )(a + 1) = 0. That is, the roots are +3 
and — 1. So the basic solutions of the recurrence relation are a n = 3" and a n = (— 1 )" , 
and the general solution is 


a„ = A 1 3" + A 2 (-l)' ! 


Now we determine A i and A 2 by using the initial conditions 

1 = ciq — Ai3° + A 2 (— 1)° = Aj + A 2 
1 = ai = Ai3 l + A 2 (— l) 1 = 3Ai - A 2 

We solve these two simultaneous equations to obtain A\ = A 2 = | (add the two 
equations together to eliminate A 2 yielding 2 = 4A 2 or A\ = and then determine 
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At). The required solution to the recurrence relation with the given initial conditions is 

a n = ^x3 n + ^x(-l) n . 

Example 3: Solution of Fibonacci Relation 

Find a formula for the number of ways for the elf in Example 2 of Section 7.1 to 
climb the n stairs. 

The recurrence relation obtained in Example 2 of Section 7. 1 was the Fibonacci 
relation a n — a„_i + a„_ 2, with the initial conditions a\ — \,a 2 = 2, or equivalently, 
a 0 = a 1 — 1. The associated characteristic equation is obtained by setting a n = a n : 


which reduces to 


a 2 = a + 1 or a 2 — a — l — 0 
Using the quadratic formula, we get 

«=^[- ( - 1) ±V ( - 1)2 - 4(1)( - 1)] =^ (1 ±v / 5 ) 

That is, we have roots 2 + 3 V5 and \ | V5, and the general solution of the problem 

is 



J_ ( i-Vs 
VsV 2 

note the surprising fact that to generate the Fibonacci sequence of integers 1, 1, 2, 3, 
5, 8, 13, ... , we need powers of ^ + j-s/5^ and ^ ■ 

Optional: The following example illustrates a solution of a recurrence relation 
that has complex and multiple roots. 

Example 4: Complex and Multiple Roots 

Find a formula for a n satisfying the relation a„ = — 2a n -2 — a„_ 4 with ao = 0, a\ = 
1, «2 = 2, and <33 = 3. 

Substituting a„ = a", we obtain a" = —2 a" -2 — a"~ 4 , which yields the charac- 
teristic equation a 4 + 2a 2 + 1 = (a 2 + l) 2 = 0. The roots of this equation are a = +i 
and a — —i (where i = \J — 1 ) and each root has multiplicity 2. Recall that when a is 
a double root, the associated recurrence relation solutions are a n — a" and a n = na n . 
So the general solution is 

a n = A\i n + A 2 ni n + A 3 (-if + A 4 n(- 1)' ! 



It is left as an exericse to show that A\ = y 1+ /^ j and A 2 = 
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The initial conditions yield the equations 

0 = c/ 0 = Af° + A 2 0i° + A 3 (-i)° + A 4 0(-i)° 

= A 1 +0+A 3 +0 

1 = Oi = Ait 1 + A2I/ 1 + A 3 (— i) 1 + A4l(— ;)' 

= i(Ai + A2 — A 3 — A4) 

2 = a 2 = A 1 r + A 2 2t 2 + A 3 (-/) 2 + A 4 2(-t) 2 

= — A \ — 2Ai — A3 — 2A\ 

3 = a 3 = Ait 3 4 5 + A 2 3t 3 + A 3 (— t) 3 + A 4 3(-/) 3 

= i(—A\ — 3A 2 + A 3 + 3A4) 

Solving these four simultaneous equations in four unknown A, ’s, we obtain 

3 13 1 

Ai = — i At = |-i A 3 — —i A 4 — i 

2 ~ 2 2 2 

Then the solution of the recurrence relation is 

a n — —~i" +l + - + i \ ni" + n(—i) n ■ 

We remind the reader that for specific values of n, such as n= 12, it is eas- 
ier to determine a\i in the two preceding examples by recursively calculating 
a 3 , c/4, c/5 up to c/12 from the recurrence relation than to solve the initial-condition 
equations. 



7.3 EXERCISES 


1. If $500 is invested in a savings account earning 8 percent a year, give a formula 
for the amount of money in the account after n years. 

Find and solve a recurrence relation for the number of n-digit ternary sequences 
with no consecutive digits being equal. 

Solve the following recurrence relations: 

(a) a n = 3a„_i + 4 a„_ 2 , a 0 — ai = 1 

(b) a n = a n — 2 , ao = a\ = \ 

(c) a n — 2a„_i - a n - 2 , aQ — ax— 2 

(d) a„ — 3a„-x — 3a„_2 + a«- 3 , «o = «i = 1, «2 = 2 


4. Determine the constants Ai and Ai in Example 3. First show that the initial 
conditions a 1 = I , « 2 = 2 are equivalent to the initial conditions ao = 1, a\ — 1. 

5. Find and solve a recurrence relation for the number of ways to arrange flags on 
an //-foot flagpole using three types of flags: red flags 2 feet high, yellow flags 1 
foot high, and blue flags 1 foot high. 
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6. Find and solve a recurrence relation for the number of ways to make a pile of 
n chips using red, white, and blue chips and such that no two red chips are 
together. 

7. Find and solve a recurrence relation for p n , the value of a stock market indicator 
that obeys the rule that the change this year (from the previous year) equals twice 
last year’s change. Suppose po = 1, p\ = 4. 

8 . Show that any linear combination of solutions to (1) is itself a solution to (1). 

9. Show that if the characteristic equation (4) has a root a* of multiplicity 3, then 
n' a". for j — 0 , 1 , 2 , are solutions of ( 1 ). 

10 . Show that if F„ is the nth Fibonacci number in the Fibonacci sequence starting 
F 0 = F\ = 1, then 

.. F„ + i 1 + V5 


11 . If the recurrence relation a n = C\a n - \ + c 20 , 1-2 has a general solution a„ = 
Ai3 n + A 26 " , find ci and C 2 . 



7.4 SOLUTION OF INHOMOGENEOUS 
RECURRENCE RELATIONS 

A recurrence relation is called homogeneous if all the terms of the relation involve 
some au- In the preceding section, we presented a method for solving any homoge- 
neous linear recurrence relation a„ = c\a n _\ + C 2 d n -2 H — • + c, a„_ r . When an addi- 
tional term involving a function of n (or a constant) appears in a recurrence relation, 
such as 


a n = ca„-i + fin) (1) 

then the recurrence relation is said to be inhomogeneous. 

In this section, we discuss methods for solving inhomogeneous recurrence re- 
lations of the form of (1). The key idea in solving these relations is that a general 
solution for an inhomogeneous relation is made up of a general solution to the as- 
sociated homogeneous relation [obtained by deleting the f(n ) term] plus any one 
particular solution to the inhomogeneous relation. 

For (1), the homogeneous relation is a n =ca n - 1 , whose general solution is 
a n = Ac" . Suppose a* is some particular solution of (1) — that is, a* = ca*_ l + f{n). 
Then we see that a n = Ac" + a* satisfies (1): 

a„ = Ac" + a* 

= Ac" + [ ca „*_! + f{n)] 

= c(Ac"~ l + a *_ ! ) + fin) = ca n - 1 + fin ) 
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The constant A in the general solution is chosen to satisfy the initial condition, as in 
the previous section ( A cannot be determined until a* is found). 

There is one special case of (1) that we can restate as an enumeration problem 
treated in previous chapters. If c = 1, then (1) becomes 

a n = a n - 1 + fin) (2) 

We can iterate (2) to get 

a\ = a 0 + /(l) 

a 2 = fl 1 + /(2) = [a 0 + /(l)] + /(2) 
a 3 =a 2 + /( 3) = [oo + /( 1) + /( 2)] + /( 3) 

n 

o n = «o + /( 1) + /(2) + /(3) H — • + fin) = uq + / (A:) 

*=t 

So a„ is just the sum of the f(k)s plus a f] . In Sections 5.5 and 6.5 we pre- 
sented methods for summing functions of n. Either method can be used to solve 
( 2 ). 


Example 1: Summation Recurrence 

Solve the recurrence relation a n +n with initial condition a \—2, obtained 

in Example 3 of Section 7.1, for the number of regions created by n mutually 
intersecting lines. 

The initial condition of a\ = 2 can be replaced by the initial condition ao = 1 
(no lines means that the plane is one big region). By the foregoing discussion, we 
see that a n = 1 + ( 1 + 2 + 3 + • — f n). The expression to be summed can be written 
as 



By identity (7) of Section 5.5, this sum equals ("^') = \n{n + 1). Then a„ = 1 + 

+ 1 ). ■ 


When c 1 in (1), there are known solutions to (1) to use for specific functions fin), 
similar to the situation for “divide-and conquer” recurrences presented in Section 
7.2. We present a table for the simplest f(n) s. These solutions can be derived by 
generating function methods introduced in the next section. 


fin) 

Particular solution p(n) 

d, a constant 

B 

dn 

B\n + Bq 

dn 2 

B 2 n 2 + Bpi + B 0 

ed" 

Bd" 
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The B s are constants to be determined. If f(n) were a sum of several different terms, 
we would separately solve the relation for each separate f(n) tenn and then add these 
solutions together to get a particular solution for the composite f(n). 

There is one case in which the particular solution for f(n) = ed" will not work. 
This involves the relation a„ — da n _\ +ed n — note here that the homogeneous so- 
lution a n — Ad n has the same form as fin). Then one must try a* = Bnd n as the 
particular solution. 


Example 2: Solving the Tower of Hanoi Puzzle 

Solve the recurrence relation a n = 2a n _ i + 1 with a\ = 1 obtained in Example 4 of 
Section 7.1 for the number of moves required to play the n-ring Tower of Hanoi 
puzzle. The general solution to the homogeneous equation a n — 2a„-\ is a„ = A2". 

We hnd a particular solution to the inhomogeneous relation by setting a* = B 
[this is the form of a particular solution given in the foregoing table when f(n) is a 
constant]. Substituting in the relation, we have 

B=a* = 2a„*_ 1 + l=2fi+l or B = - 1 

So a* — — 1 is the particular solution, and the general inhomogeneous solution is 
a n = A 2" + a* = A2 n — 1. We now can determine A from the initial condition: 1 = 
fli = A 2 1 — 1, or 1 =2 A — 1. Hence A = 1, and the desired solutionis a n = 2" — 1. m 


Example 3: Compound Inhomogeneous Term 

Solve the recurrence relation a„ = 3a„_i — 4 n + 3 x 2" to find its general solution. 
Also find the solution when a\ = 8. 

The general solution to the homogeneous equation a n — 3a„-i is a„ = A3". We 
solve for a particular solution of the relation separately for each inhomogeneous term. 
For a n = 3a„_i — An, we try the form a* — Bpi + B t) , obtaining 

B\n + B 0 = a* = 3 a*_ l - An = 3[B\(n - 1) + B 0 ]-An. (4) 

We now equate the constant terms and the coefficients of n on each side of (4): 

Constant terms: Bq = —3B\ + 3Bq (5) 

n terms: B\n — 3B\n — An or B\=3B\—A (6) 

Solving for B\ in (6), we obtain B\ = 2. Substituting B\ = 2 in (5), we obtain Bq — 3. 
So a* — 2n + 3 is a particular solution of a„ = 3a n _i — An. 

Next for a„ = 3a„_i + 3 x 2", we try = 112" , obtaining 

B2 n = a+ = 3 a+_ x + 3x2"= 3(B2 n ~ 1 ) + 3x2" (7) 

Dividing both sides of (7) by 2" _1 , we get 2B = 35 + 6, or B — —6. So = — 6 x 2" 
is a particular solution of a n — 3a„_[ +3 x 2". Combining our particular solutions 
with the general homogeneous solution, we obtain the general inhomogeneous 
solution 


a n = A3" + 2n + 3 — 6 x 2' 
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When a i = 8, we can determine A: 

8 = fll = A3 1 +2(l) + 3-6x2 1 
= 3A-7 

Thus A = 5, and the solution is 5 x 3" + 2n + 3 — 6 x 2" . ■ 


7.4 EXERCISES 


1. Solve the following recurrence relations: 

(a) a„ — a„-\ + 3 (n - 1), a 0 = 1 

(b) a„ = a n - 1 + n(n — 1), ao = 3 

(c) a n — a„_ i + 3 n 2 3 4 5 6 7 8 9 , ao — 10 

2. Find and solve a recurrence relation for the number of infinite regions formed by 
n infinite lines drawn in the plane so that each pair of lines intersects at a different 
point. 

3. Find and solve a recurrence relation for the number of different square subboards 
of any size that can be drawn on an n x n chessboard. 

4. Find and solve a recurrence relation for the number of different regions formed 
when n mutually intersecting planes are drawn in three-dimensional space such 
that no four planes intersect at a common point and no two planes have parallel 
intersection lines in a third plane. [Hint: Reduce to a two-dimensional problem 
(Example 1).] 

5. Find and solve a recurrence relation for the number of regions into which a convex 
u-gon is divided by all its diagonals, assuming no three diagonals intersect at a 
common point. (Hint: Sum the inhomogeneous term using a special case of an 
identity from Section 5.5.) 

6. If the average of two successive years’ production |(a„ + is 2n + 5 and 
ao — 3, find a„. 

7. Solve the recurrence relation a n = I ,04a„_ t + 1 00. a () = 0, from part (b) of 
Example 5 in Section 7.1. 

8. Suppose a savings account earns 5 percent a year. Initially there is $1,000 in the 
account, and in year k. $ I Ok are withdrawn. How much money is in the account 
at the end of n years if 

(a) Annual withdrawal is at year’s end? 

(b) Withdrawal is at start of year? 

9. Solve the following recurrence relations: 

(a) a„ = 3a„_i — 2, ao = 0 (c) a„ = 2a n -\ + n, «o = 1 

(b) a n — 2a„_i + (—1)” , ao — 2 (d) a„ = 2a„-\ + 2 n 2 , ao — 3 
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10 . Solve the recurrence relation a„ = 3«„_i + n 2 — 3, with ao = I . 

11 . Solve the recurrence relation a„ = 3«„_i — 2a„_2 + 3, a () = a\ = 1. 

12 . Find and solve a recurrence relation for the number of «-digit ternary sequences 
in which no 1 appears to the right of any 2. 

13 . Find and solve a recurrence relation for the earnings of a company when the rate 
of increase of earnings increases by $10 x 2 k in the Mi year from the previous 
year, where a o = 20 and ci\ = 1020. 

14 . Show that the general solution to any inhomogeneous linear recurrence relation 
is the general solution to the associated homogeneous relation plus one particular 
inhomogenous solution. 

15 . Show that the form of the particular solution of (1) given in the table in this 
section is correct for 

(a) f(n) — d (c) f(n) — dn 2 

(b) / («) = dn (d) f(n) — d n 

16 . Show that if fin) in (1) is the sum of several different terms, a particular solution 
for this f(n) may be obtained by summing particular solutions for the individual 
terms. 

17 . Find a general solution to a n — 5a„_i + 6«„_2 = 2 + 3 n. 

18 . If the recurrence relation a n — c\a n -\ + C2+1-2 = C3« + C4 has a general solution 
a„ = Ai2" + A25" + 3/2 — 5, find c\, C2, C3, c\. 

19 . Solve the following recurrence relations when ao = 1 
(a) a 2 = 2a 2 + 1 (Hint: Let b n = a 2 .) 


(b) a n — —na„-i + n ! [Hint: Define an appropriate b n as in part (a).] 



7.5 SOLUTIONS WITH GENERATING FUNCTIONS 

Most recurrence relations for a„ can be converted into an equation involving the 
generating function g(x) = ao + a \ x H — • + a H x" H — •. This associated functional 
equation for g(x ) can often be solved algebraically and the resulting expression for 
g(x) expanded in a power series to obtain a„ as the coefficient of x n . Some of the 
algebraic manipulations of the functional equations may be new to the reader. 

We will treat g(x) as if it were a standard single variable, such as y. and treat other 
functions of x as constants. For example, the functional equation g(+) = x 2 g(x) — 2x 
can be solved by rewriting the equation as g(x)(l — x 2 ) = —2x and hence g(x) = 
—2x(l — x 2 ) -1 . Similarly, the functional equation 

(1 - x 2 )[g(x)] 2 - 4 xgix) + 4x 2 = 0 (1) 

can be solved by the quadratic formula that we normally apply to equations such 
as ay 2 + by + c = 0. Now a = (1 — x 2 ), b = — 4x, and c — 4x 2 . Intuitively, for each 
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particular value of x, g(x) is the solution of (1). Thus, by the quadratic formula, the 
solution to (1) is 

g(x) = z— ■ — y-[4x ± yj 16x 2 — 16x 2 (l — x 2 )] = — — ^ — ^-(4x ± 4x 2 ) 

2(1 — x z ) 2(1 — x z ) 

So g(x) = 2(x + x 2 )/(l — x 2 ) or 2(x — x 2 )/(l — x 2 ). If there are two (or more) 
possible solutions, only one will normally make sense as a generating function 
for a n (e.g., have a power series expansion with the correct value for the initial 
condition ao). 

Now let us show by example how some of the recurrence relations obtained in 
Section 7.1 can be converted into functional equations for an associated generating 
function. 


Example 1: Summation Recurrence 

Find a functional equation for g(x) = a () + d\X + ■ — f a„x" H — • where a n satisfies 
the recurrence relation a n = a n - \ + n, when n > 1, obtained in Example 3 of Section 
7.1. The initial condition was uq = 1. Solve the functional equation and expand g(x) 
to find a„ . 

Using this recurrence relation for every term in g(x) except «o, we have a„x“ = 
a n _\X n + nx" , n > 1 . Summing the terms, we can write 

oo oo 

g(x) -a o = ^ a n x n = ^(a„_ ix" + nx' 1 ) (2) 

n = 1 n = 1 

oo oo 

= x ^ a n -\x n ~ x + ^ nx n (3) 

n= l «=i 

OO OO / \ 

= x ^2 a n,X m + ^ ” )x" (4) 

m = 0 n= 0 ^ ' 

= xg(x) + (5) 

(i -x) z 

Line (3) is obtained from line (2) by breaking up the sum of the two x" terms into 
sums of each term, and by rewriting as xa n - ix" -1 (in order to make the power 

of x correspond with the subscript of Line (4) is obtained from line (3) by 

reindexing the first sum with m = n — 1, and by adding the “phantom” (zero) term 
0x° to the second sum and rewriting n as C(n, 1). The first series is the generating 
function g(x) multiplied by x. The second series has a generating function obtained 
by the construction presented in Section 6.5. Equating line (5) with g(x) — [the 
left side of line (2)] and setting oq= 1, we have the required functional equation for 
g(x ): 


X 


g(x) - 1 = xg(x) + 


(1-x) 2 


( 6 ) 
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Solving for g{x), we rewrite (6) as 

x x 

g(x)~xg(x)= 1 + - -r or g(x)(l -x) = 1 + — 

(1 — X ) 1 (1 — x) z 

Thus 

sM =(ih) + (rh? 

The coefficient of x" in (1 — x) -1 is just 1 and in x(l — x) -3 is C((n — ■ 1) + 3 — 
1, n — 1) = C(n + 1, n — 1) = C(n + 1, 2). Then 


g(x) = 


1 


(1-x) (1 x) 3 


=E*"+E 


n = 0 


n= 0 


« + 1 


E 

n= 0 L 


1 + 



X 


n 


and so a„ = 1 + C(n + 1 , 2) — the same answer as obtained for this recurrence relation 
in Example 1 of Section 7.4. ■ 


Example 2: Fibonacci Relation 


Use generating functions to solve the recurrence relation a n = a„_ \ + a„_ i, with a\ = 
1, fl 2 = 2 obtained in Example 2 of Section 7.1. 

The initial conditions, a i = 1, a 2 — 2, are equivalent to ao = 1, a\ = 1. Then 
using the same power series summation approach as in the previous example, we 
obtain 


n= 2 


g(x) — ao — a\x = a n x" = ^\a„- \x n + a n - 2 x") 

n = 2 

oo oo 

E «_i 2 \ v n 

a n - \X +X 2_, a n-2X 


n = 2 


n= 2 


— x a m x m + x 2 a^-x^ 

m= 1 it=0 

= Jc[g(^)-ao] + JC 2 g(^) 


Setting ao = 1 and ai = 1, we have the functional equation 

g(x)-l-x=x[g{x)-l] + x 2 g(x) or g(x) xg(x) x 2 g(x) 1—0 
and so 

g(x)(l — x — x 2 ) = 1 or g(x) = 1/(1 — x — x 2 ) 

Observe that this denominator is closely related to the characteristic equation of 
this recurrence relation, given in Example 3 of Section 7.3. In general, g(x) will have 
a denominator 1 + cqx + C 2 X 2 + • ■ ■ + c r x r if and only if x r + c r ^ l x r ~ 1 + C 2 x'~ 2 + 
■ ■ ■ + c r is the characteristic equation of the associated recurrence relation, and 
so 1 — ax is a factor of the denominator of g(x) if and only if a is a root of the 
characteristic equation. The numerator will depend on the initial conditions. 
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By the quadratic formula, we can factor 
1 - 


■* 2 = 


1-I ( l + V5)x][l-^(1-V5)^ 


For simplicity, let us define = 1(1 + V5) and an = 1(1 — V5). Then we have 

1 «i/V5 a 2 /V5 


g(x) = 


(1 — aqxXl — ci 2 x ) 1— &\X l — 012X 


(7) 


The decomposition of g(x) into two fractions in (7) is obtained by the method 
of partial fractions, the reverse process of combining two fractions into a common 
fraction. Setting y — a.\X in the first fraction on the right side in (7), we have 


«!/ 


/V5 1 \ _ «l_y^ n _ a 1 


1 — a\x -y/5 \ 1 


■y 


= 'Y] y” = 'Y] ct'[x" 

V5 


n= 0 


V5 


n= 0 


The same type of expansion exists for y — anx. Then a n , the coefficient of x n in the 
power series expansion of g(x), is 


1 


+1+1 


1 


V5 V5 


„"+l 


1 1 

(i + Vs) 

n+l 

1 1 1 

h-V5\ 

V5 ' 

2 

V5 1 

2 


n+1 


Observe that it is much easier to determine a\o by recursively computing 
a 2 , 03 , . . . up to aio using the Fibonacci relation than by setting n — 10 in the foregoing 
formula for a„ . 


Example 3: Selection without Repetition 

Let g„(x) be a family of generating functions g„(x) = a„,o + o„jx H — • + a, h kX k + 
■ ■ ■ + a n n x n satisfying the relation a„ ^ = Qn-i.k + witha„ o = a n,n = 1 (and 

= 0 ,k> n) obtained in Example 8 of Section 7.1 for the number of A+subsets 
of an n-set. Find a functional relation among the g n (x )s and solve it to obtain a formula 
for a „ Using the power series summation method, we obtain 

n n 

1 = ^ ^ &n,k.X = ^ l,kX “f” ®n— l,k— \X ) 

k= 1 k= 1 

n n— 1 

= ^2 Q n-l,kX k + X ^2 a n-\,hX h 
k=l h= 0 

= gn- lW“ l+Xg„-l(*) 


Thus 


gnW = g n -i(x) + Xg n -i(x) = (1 + x)g n -\(x) 
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The resulting recurrence g n (x ) = (1 + x)g n _ l (x) is solved just like the recurrence 
a n = ca„ _ i . The solution is 

g„(x) = (1 + x) ,! g 0 (x) = (1 + xf 

since go(^) = flo.o = 1. Now, by the binomial theorem, we have a„ j, : — C(n, k). ■ 


Optional: The rest of this section involves more complicated computations. The 
reader may skip this material. First we consider a nonlinear recurrence relation and 
solve it using generating functions. 


Example 4: Placing Parentheses 


Solve the recurrence relation a n = a\a„-i + a 2 fl n -2 + • • • + cucin-i + • • • + a„_iai 
obtained in Example 10 of Section 7.1 for the number of ways to place parenthe- 
ses when multiplying n numbers. 

Observe that if g(x) = ao + a\x -\ — • + a n x n + ■ ■ ■ , then the right-hand side of 
this equation is simply the coefficient of x n , for n > 2, in the product g(x)g(x) = 
(0 + aix + ■ — h a„x" H — -) 2 - Using the power series summation method, we have 
(recall that ao — 0 and a i = 1) 

OO OO 

g(x)~ lx = Y, a n x n = Y.(a 1 a n - l +a 2 a n -2l |-a„_iai)x" = [g(x )] 2 

n = 2 n—2 


Solving this quadratic equation in g(x) as described at the start of this section, we 
obtain g(x) — 1(1 ± s/1 — 4x). To make ao — 0 [i.e., g(0) = 0], we want the solution 
g(x) = 1 - iVl-4x. 

This g(x) requires a new type of generating function expansion called the general- 
ized binomial theorem. The power series expansion (1 + y) q = (^) + (^)y + (f)y 2 + 
f (//) y n 3 — • , where q is any real number, has a coefficient (// of y n defined as 

/V\ q(q - 1 )0? - 2) x ■ ■ ■ x [q - (7i - l)] ^ 

\n) n\ 


[The formula for this generalized coefficient (// arises from the Taylor series for 
(1 + yf; see any standard calculus text.] 

Using (8), the coefficient of x" in s/l — 4x is [we think of Vl — 4x as ( 1 + y)U 2 , 
where y = — 4x] 



i(-i)(-3) x ...x[-l(2»-3)] ( iy 

n! 

— Ixlx3x5x---x (2 n — 3) n 
n\ 

2 12 n - 2\ 
n\n — 1 / 


(9) 


The last step in (9) is obtained by multiplying certain numbers in the numerator by 
appropriately selected powers of 2 (details are left to Exercise 6). Multiplying the 
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final expression in Eq. (9) by — we obtain the coefficient of a" in 1 — 4x, 


a n 


1 n n - 2 

n\n — 1 


n > 1 ■ 


The expression \C{2n — 2, n — 1) arises in various combinatorial settings and is 
called the nth Catalan number. We note, as an aside, that while where parentheses 
are placed makes no real difference when multiplying numbers, if we were working 
with a complex product of matrices then the placement of parentheses has an impor- 
tant impact on the amount of computation required. Next let us consider generating 
functions for simultaneous recurrence relations. 

Example 5: Simultaneous Recurrence Relations 

Use generating functions to solve the set of simultaneous recurrence relations obtained 
in Example 1 1 of Section 7.1: 

a n — a n — i T b n _ \ T c n — \ , b n =3 c n — \ , c n = 3 b n — i , 

ai = b\ =ci = 1 


Let A(x), B(x), and C(x) be the generating functions for a n , b„, and c„, respectively. 
We use the power series summation method to obtain 


OO 

A(x ) — cio = ^ a n x n 

n = 1 


oo 

B(x) - bo = Y, b„x n 

n= 1 


oo 

C(x) — cq = Y: c n x" 

n= 1 


oo oo oo 

Y. a »- i x " + X! bn -' x " + 'Yh c «-i x " 

n= 1 n= 1 n= 1 

oo oo oo 

X Y, a mX m + X ^2 b mX m + X ^ C m X m 
m = 0 m= 0 m = 0 

xA(x) + xB(x) + xC(x) 

OO OO 

n=l n= 1 

x{\ — 3x) _1 — xC(x) 

OO OO 

Y / 3 n l X n — Yj b n-\X n 
n= 1 n= 1 

x( 1 — 3x) _1 — xB(x) 


It is always desirable to state initial conditions in terms of ao,bo,co- Solving 
our three recurrence relations for ao, bo, c o given a\ = b\ = c i = 1, we get 1 = b\ = 
3° — Co or co — 0. Similarly, we find that b o = 0 and ao = 1- Then our functional 
equations are 

A(x) — 1 = xA(x) + xB(x) + xC(x) 
or 

1 


1 — x 


A(x) = 


[xS(x) + xC(x)+ 1] 


(10) 
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and 


B(x)~ ^ *C(x) 

(id 

C(*)- xB(x) 

1 — 3x 

(12) 


We can solve (11) and (12) for B(.x) and C(x) simultaneously. Multiplying (12) 
by x and using this expression for xC(x) in (11), we have 


B(x) = 


1-3* 


■ *C(x) = 


B(x)( l-x 1 2 ) = 
B(x) = 


1 — 3* 

v-2 


1-3* 


■ xB(x) 


1-3* 
(1 — *)* 


(1 — * 2 )(1 — 3*) (1 +*)(1 — 3*) 1-3* 1+* 

The last step is again a partial fraction decomposition. The coefficient of x" in 
|(1 — 3*) _1 is |3" and in — |(l+x) _1 is — j(— 1)". So b„, the coefficient of *" 
in B(x), is 1[3" — (—1)"]. Equations (11) and (12) are symmetric with respect to 
B(x ) and C(*), and so C(x ) = B(x) and c n = b n = 1[3" — (— 1)"]. 

Next we solve for A(x) in (10): 


A(*) = 


1 2 * 

[xB(x) + *C(x) + 1] = B(x) - 


1 


1 — * 

[since B(x ) = C(*)] 

2 * 


1 — * 


1 — * 



(!-*)(! -3*) 1 


\ x 


1 — 3* 1 — * 


1 -* 


The coefficient of x" in 1(1 — 3*) _1 is ^3", in —1(1 — *) _1 is —1, in — 1(1 — x 2 ) -1 
is — 1, n odd, or 0, n even, and in (1 — *) _1 is 1. Collecting these terms, we get 
a n = 4(3" + 3), n even, and = 1(3" + 1), n odd. ■ 



7.5 EXERCISES 


1. Find functional equations for the generating functions whose coefficients satisfy 
the following relations: 

(a) a„ — a„-i +2, a 0 — 1 

(b) a n = 3a„-i — 2a„_2 + 2, ag — a\ = 1 
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(c) a„ — a n _ i + n(n — 1), «o = 1 

(d) a„ — 2a„-i + 2", a 0 — 1 

2. Solve the recurrence relations in Exercise 1 using generating functions. 

3. Find functional equations for the generating functions whose coefficients satisfy 
the following relations: 

n— 1 

(a) a„ = J2 a i a n-i-i (n > 1), a 0 = 1 

;= o 

n— 2 

(b) a n = a i a n-i (n > 3), = a\ = <22 = 1 

1=2 

n— 1 

(c) = X! 2* £!„_,■ (n > 2), fl 0 = fli = 1 

t=l 

4. Find a functional equation and solve it for the sequence of generating functions 
F n (x) = YH=o a n.kX k whose coefficients satisfy the following: 

(a) a n 'k — a. n 'k— i 2a n —\^—\^ a n $ — 0 
(h) a n j c — 2a n _\ ^ — 1 

5. Verify the form of particular solutions to inhomogeneous recurrence relations in 
the table in Section 7.4. 

6. Verify in (9) that 

— lx Ix3x5x---x (2 n — 3) 2 /2;i — 2 

n ! n \ n — 1 


7. Find a recurrence relation and solve it with generating functions for the number 
of ways to divide an /z-gon into triangles with noncrossing diagonals. (Hint: Use 
reasoning similar to Example 4.) 

8. Find a recurrence relation and associated generating function for the number of 
n-digit ternary sequences that have the pattern “012” occurring for the first time 
at the end of the sequence. 

9. Find a recurrence relation and associated generating function for the number of 
different binary trees with n leaves. 

10. Find a recurrence relation for a,,^, the number of ^-subsets of an n set with 
repetition. Find an equation for F n (x) — Y^k=o a n,kX k > an d solve for F„(x) and 

&n,k • 

11. Let a„'k be the probability that k successes occur in an experiment with n trials 
if each trial has probability p of success. Find a recurrence relation for a n j c . Use 
this relation to find and solve an equation for F„(x ) = X^=o a n,k* k ■ 

12. (a) Find a recurrence relation for a, u k, the number of ^-permutations of n 

elements. 
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(b) Show that F n (x) = YH=o a n,kX k satisfies the differential equation [F' n _ j(x) 
denotes the derivative of 1 (x)] 


F„(x) = {\+x)F n _ l {x) + x 2 F’ n _ x (x) 


n 


(c) Find a functional equation for G„(x) = a n ^x k / k\ 

k = 0 


13. Find and solve a system of recurrence relations allowing one to determine the 
number of »-digit quaternary sequences with an odd number of Is and an odd 
number of 2s. 

14. Find and solve simultaneous recurrence relations for determining the number of 
n-digit ternary sequences whose sum of digits is a multiple of 3. 

15. (a) Find a recurrence relation for a n , the number of ways to partition n distinct 

objects among n indistinguishable boxes (some boxes may be empty). 

(b) Let g(x) = Y^=o a nX n />i\ where « (l = I . Show that g(x) satisfies the differ- 
ential equation g'(x) = g(x)e x . Solve this equation for g(x). 

16. (a) Define s nj as numbers such that (l .v„,x' = x(x — 1 )(x — 2) ■ ■ ■ 

(x — n + 1). Find a recurrence relation for s nj . 

(b) Find a differential equation for F n {x) = ]L” =(| s nj x r jr\ 



7.6 SUMMARY AND REFERENCES 



In this chapter we saw that recurrence relations are one of the simplest ways to solve 
counting problems. Without fully understanding the combinatorial process, as was 
required in Chapter 5, we now need only express a given problem for n objects in 
terms of the problem posed for fewer numbers of objects. Once a recurrence relation 
has been found, then starting with ti\ (the solution for one object), we can successively 
compute the solutions for 2, 3, ... up to any (moderate) value of n. Or we can try one 
of the techniques in the later sections of this chapter to solve the recurrence relation 
explicitly. 

The first recurrence relation in mathematical writings was the Fibonacci relation. 
In his work Liber abaci, published in 1 220, Leonardo di Pisa, known also as Fibonacci, 
posed a counting problem about the growth of a rabbit population (Exercise 7 in 
Section 7.1). The number a„ of rabbits after n months was shown to satisfy the 
Fibonacci relation a n — a n _ i + «„_ 2 . As mentioned in Section 6.6, DeMoivre gave 
the first solution of this relation 500 years later in 1730 using the generation function 
derivation given in Example 2 of Section 7.5. The Fibonacci relation and numbers 
have proven to be amazingly ubiquitous. For example, it has been shown that the 
ratios of Fibonacci numbers provide an optimal way (in a certain sense) to divide 
up an interval when searching for the minimum of a function in this interval (see 
Kiefer [3]). The appearance of Fibonacci numbers in rings of leaves around flowers 
is discussed in Adler [1]. 
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The methods for solving recurrence relations appeared originally in the devel- 
opment of the theory of difference equations, cousins of differential equations. For 
a good presentation of the methods and applications of difference equations, see 
Sandefur [4]. 
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CHAPTER 8 

INCLUSION-EXCLUSION 



COUNTING WITH VENN DIAGRAMS 


In this chapter we develop a set-theoretic formula for counting problems involving 
several interacting properties in which either all properties must hold or none must 
hold. An example is counting all five-card hands with at least one card in each 
suit, or equivalently, all five-card hands with no void in any suit. In the process of 
solving such problems, we have to count the subsets of outcomes in which various 
combinations of the properties hold. We use Venn diagrams to depict these different 
combinations. See Appendix A.l for a review of the essentials of sets and Venn 
diagrams. 

Let us begin with a one-property Venn diagram, and then progress to two- and 
three-property problems. In Figure 8.1 we show a set A within a universe d(. The 
complementary set A consists of all elements of b lt not in A. Let N(S) denote the 
number of elements in set S. We define N = V(Tt). Then N (A) — N — N(A), or 
N(A) — N — N(A). This is similar to the situation in probability where the probability 
of an event £ is 1 minus the probability of the complementary event E. 

Suppose, for example, that there is a “universe” of 100 students in a math course 
and there are 30 students who are not math majors in the course — that is, N(A) = 30, 
where A is the set of math majors. Then the number N(A) of math majors in the course 
is N(A) = N- N(A) = 100 - 30 = 70. 

Next consider a problem with two sets. Let the universe s li be all students in a 
school, let F be the set of students taking French, and let L be the set of students taking 
Latin. See Figure 8.2. We want formulas for the number of students taking French 
or Latin N(F U L) and the number taking neither language N(F fl L) in terms of N, 
N(F), N(L), and N(F fl L). Note that N(F U L ) is not simply N(F) + N(L), because 
N{F) + N(L) counts each student taking both languages two times. Thus, we must 
know how many students take both languages, the number N(F fl F). Subtracting 
N(F fl L) from N ( F) + N(L) corrects the double counting of students taking two 
languages. That is, 


V(£UL) = N(F) + N(L)~ N(FnL) (1) 


By de Morgan’s law (Equation BA3 of Appendix A.l), F fl L — F U L, and so 
N(F nl) = N{FUF) = N- IV(FUL) 
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Combining this equation with (1), we have 

N(F CL) = N- N(F U L) = N - N(F) - N(L) + N(F n L) (2) 

Formula (2) is the 2-set version of the general n-set inclusion-exclusion formula 
that we present in the next section. It is called an inclusion-exclusion formula because 
first we include the whole set, N; then we exclude (subtract) the single sets F and L ; 
and then we include (add) the 2-set intersection F CL. With more sets, this alternating 
inclusion and exclusion process will continue several rounds. 


Example 1: Students Taking Neither Language 

If a school has 100 students with 50 students taking French. 40 students taking Latin, 
and 20 students taking both languages, how many students take no language? 

In this problem, N — 100, N(F) — 50, N(L ) = 40, and N(F fl L) — 20. We need 
to determine N(F fl L). By Eq. (2), we have 

N(F CL) = N - N(F) - N(L) + N(F C L) = 100-50-40+20 = 30 ■ 

The next example applies this set-theoretic formula to a counting problem that 
has no obvious set-theoretic structure. 


Example 2: Restricted Arrangements 

How many arrangements of the digits 0, 1 , 2, . . . , 9 are there in which the first digit 
is greater than 1 and the last digit is less than 8? 

Before using formula (2) to solve this problem, let us consider a more direct 
approach and see why it fails. For the first digit, there are eight choices. Then for the 
last digit, there are . . . the number of choices depends on whether or not an 8 or 9 
was chosen for the first digit. If an 8 or 9 were chosen for the first digit, there will be 
eight choices for the last digit, while if neither 8 nor 9 were chosen for the first digit. 
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there will be seven choices for the last digit. Because of this difficulty, we shall solve 
the problem with formula (2). 

Let °li be all arrangements of 0, 1, 2, . . . , 9. Let F be the set of all arrangements 
with a 0 or 1 in the first digit, and let L be the set of all arrangements with an 8 or 9 
in the last digit. Then the number of arrangements with first digit greater than 1 and 
the last digit less than 8 is N(F fl L). 

We have N = 10!, N(F) = 2x9! (two choices for the first digit followed by any 
arrangement for the remaining 9 digits), and N(L) = 2x9! Similarly, N(F fl L) = 
2x2x8!. Then by (2), 

]V(FnI)= 10! — (2x9!) — (2x9!) + (2 x2x 8!) ■ 

The strategy used to solve Example 2 and most other examples in this chapter is 
as follows: when faced with multiple constraints that are difficult to enumerate, we 
try to solve the problem by counting the sizes of the complementary sets for these 
constraints, as well as the intersections of these complementary sets. The inclusion- 
exclusion formula tells us how to put together the answers to the complementary-set 
calculations solve the original problem. 

There is one important notational convention to watch out for. The sets in the 
inclusion-exclusion formula are always defined so that the final answer (the left side 
of (2)) is the number of items that are in none of the sets. Thus, we need to define sets 
that represent the complements of the original constraints we are given. For example, 
in Example 2, the set of arrangements where the first digit is not greater than 1 — the 
complement of the given constraint on the first digit — is defined to be F, so that the 
original constraint of the first digit being greater than 1 is now F . 

Consider next a problem with three sets. We extend Figure 8.2 with the additional 
set G of students taking German, as shown in Figure 8.3. We want a formula for 
N(F nifl G). A first guess might be 

N(F n L n G) = N - N(F) - N(L) - N(G) 

As in Figure 8.2, this formula double-counts (that is, subtracts twice) the students 
in two of the sets, F, L, and G. We can correct this first formula by adding the number 
of students taking two languages. Thus we propose the formula 

N(F D L n G) = N — N(F) - N(L) - N(G) 

+N{F ni)+ N(L n G) + N(F n G) (3) 



Figure 8.3 
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Number of 
Languages 
Taken by 
Student 

N 

~[N(F) + N(L) 

+ N(G)] 

+ [N(F fl L) 

+ N(L n G) 

+ N(F n G)] 

—N(F nine) 

0 

+ 1 

0 

0 

0 

1 

+ 1 

-1 

0 

0 

2 

+ 1 

-2 

+1 

0 

3 

+ 1 

-3 

+3 

-1 


Figure 8.4 

Figure 8.4 shows how many times a student will be added and subtracted in parts 
of this formula. A student taking no language is counted once (by the term N) — such 
students are exactly the ones we want to count. The challenge is to make sure that 
all other students are counted a net of 0 times. The students taking one language 
are counted once by N and subtracted once by the term ~[N(F) + N(L) + N(G)], 
for a net count of 0. The students taking two languages are counted once by N, 
subtracted twice by —[N(F) + N(L) + N(G)] (since they are in exactly two of the 
three sets), and then added once by the term +[N(F fl L) + N(L fl G) + N(F fl G)] 
(since they are in exactly one of the three pairwise intersections), for a net count of 0. 
Finally, we consider the students taking all three languages. They are counted once 
by N, then subtracted three times by the sum of the three sets (since they are in all 
three sets), then added three times by the pairwise intersections (since they are in all 
three of these subsets). This yields a net count of 1— 3 + 3=1. Then we must correct 
formula (3) by subtracting N(F fl L fl G) to make the net count of students with all 
three languages 0: 

N{F n L n G) = N - [N(F) + N(L) + N(G)] 

+ [N(F n L) + N(L n G) + N(F n G)] - N(F n L n G) (4) 

For general sets A i, At, A 3 , we rewrite (4) as 

N(Ai n A 2 n A 3 ) = N — N(Ai) + J2 N(Ai Fl Aj)~ N(A 1 n A 2 n A 3 ) (5) 

i ‘j 

where the sums are understood to run over all possible i and all i,j pairs, respectively. 

Example 3: Students Taking None of Three Languages 

If a school has 100 students with 40 taking French, 40 taking Latin, and 40 taking 
German, 20 students are taking any given pair of languages, and 10 students are taking 
all three languages, then how many students are taking no language? 

Here N = 100, N(F) = N(L)= N(G) = 40, N(F fl L) = N(L n G) = 
A(F fl G) = 20, and N(F fl L fl G) = 10. Then by (4), the number of students 
taking no language is N(F fl L fl G) = 100 — (40 + 40 + 40) + (20 + 20 + 20) — 
10 = 30. ■ 
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Next we apply (5) to two counting problems that cannot be solved by methods 
developed in the three previous chapters. 

Example 4: Relatively Prime Numbers 

How many positive integers <70 are relatively prime to 70? 

Let fc ll be the set of integers between 1 and 70. The phrase “relatively prime to 
70” means “have no common divisors with 70.” The prime divisors of 70 are 2, 5, 
and 7. Then we want to count the number of integers <70 that do not have 2 or 5 
or 7 as divisors. Let A\ be the set of integers in 0 ! that are evenly divisible by 2, or 
equivalently, integers inTl that are multiples of 2; A 2 be integers evenly divisible by 5; 
and A 3 be integers evenly divisible by 7. Then the number of positive integers <70 
that are relatively prime to 70 equals AT A 1 n A2 n A3). We find 

N = 10 N(A\) = 70/2 = 35 N(A 2 ) = 70/5 = 14 N(A 3 ) = 70/7 = 10 

The integers evenly divisible by both 2 and 5 are simply the integers evenly divis- 
ible by 10. Thus N(Ai fl A2) = 70/10 = 7. By similar reasoning, N(A 2 fl A3) = 70/ 
(5x7) = 2, N(A[ n A 3 ) = 70/(2 x 7) = 5, and N(A { DA 2 D A 3 ) = 70/(2 x 5 x 7) = 1. 
So by Eq. (5): 

AT A 1 nA 2 nA 3 ) = 70-(35+ 14+ 10) + (7 + 2 + 5)- 1 = 24 ■ 

Example 5: Ternary Sequences with No Voids 

How many 72 -digit ternary (0, 1, 2) sequences are there with at least one 0, at least 
one 1, and at least one 2? How many 72 -digit ternary sequences with at least one void 
(missing digit)? 

Let At be the set of all 77 -digit ternary sequences. Formula (5) counts outcomes 
in which none of a set of properties holds. So we must formulate the first part of this 
problem in terms of outcomes for which none of a set of properties holds. The solution 
is to define A, to be the number of 72 -digit ternary sequences with no is, for / = 0, 1,2. 
(Note that instead of numbering the sets A 1; A 2 , A3, we are using Ao, Ai, A 2 .) Then 
the number of sequences with at least one of each digit will be N(Aq fl Ai fl A 2 ). 

The number of 72 -digit ternary sequences is N = 3" . The number of 72-digit ternary 
sequences with no 0s is simply the number of 72-digit sequences of Is and 2s. Thus, 
N(Aq) = 2'\ Similarly, N{A\) = N(A 2 ) = 2". The only 72-digit sequence with no 
0s and no Is is the sequence of all 2s. Then N(Aq n Ai) = 1; also N(A \ fl A 2 ) = 
N(Aq (T A 2 ) = 1. Finally, there is no ternary sequence with no 0s and no Is and no 
2s. Then by (5), 

N(A 0 n Ai n A 2 ) = 3" - (2" + r + 2") + (1 + 1 + 1) - 0 
= 3" - 3 X 2" + 3 

Now we turn to the second part of this problem involving 72 -digit ternary se- 
quences with at least one void. The phrase “at least” is used in a very different way 
here than it was used in the first part. At least one void means a void of the digit 0 or 
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a void of the digit 1 or a void of digit 2. In terms of the A, defined above, we want to 
count the union of the A,’s — namely, N(Ai U A 2 U A3). Thus, the sequences with at 
least one void are exactly the complement of the sequences with no voids that were 
counted in the first part. So the answer to the second part is 

N(A[ UA 2 U A3) — N — N(Ai n A 2 n A 3 ) = 3" - (3" - 3 x 2" + 3) 

= 3 x 2" - 3 ■ 

The reader must be constantly alert for union problems when seeing the phrase 
“at least” in inclusion-exclusion problems. In one case (the more common case), we 
are counting outcomes with no voids of any type — an intersection problem — which 
means at least one outcome of the first type and at least one outcome of the second 
type and etc. In the other case, we are counting outcomes with at least one of a set of 
properties — a union problem — which means an outcome with the first property or an 
outcome with the second property or etc. A general formula for union problems will 
be presented in the next section. 

Observe that whereas polynomial algebra was used in Chapter 6 to model count- 
ing problems and recurrence relations were used in Chapter 7, now we are using a set- 
theoretic model. This approach does not eliminate combinatorial enumeration as the 
other models did. We still must solve the subproblems of finding V(A,), Ni A, fl A ; ), 
and so forth, but these are much easier problems. 

Sometimes non-standard input data for a Venn diagram are given. The following 
example shows how to determine the sizes of all possible subsets in a Venn diagram 
in such circumstances. 

Example 6: Nonstandard Constraints 

Suppose there are 100 students in a school and there are 40 students taking each 
language, French, Latin, and German. Twenty students are taking only French, 20 
only Latin, and 15 only German. In addition, 10 students are taking French and Latin. 
How many students are taking all three languages? No language? 

We draw the Venn diagram for this problem and number each region as shown 
in Figure 8.5. Let N, denote the number of students in region i, for /' = 1 , 2, .... 8. 
Students taking only French are the subset F f! L fl G, region 1 ; so N\ — 20. Similarly, 
the other information given us says Ns = 20 and N-/ — 15. Students taking both French 
and Latin are the subset FnL = (FninG)U(FnLn G), regions 2 and 3. So 



Figure 8.5 
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N 2 + N 3 = 10. The set of students taking French is F, which consists of regions 1, 2, 
3, and 4. So 

40 = N(F) = N\ + (N 2 + N$) + IV4 = 20 + (10) + N 4 , or N 4 = 10 

Similarly, set L consists of regions 2, 3, 5, and 6, and so 40 = N(L) = (N 2 + N 3 ) + 
N 5 + N(, = (10) + 20 + /Vg, implying N( } — 10. 

Since G consists of regions 3, 4, 6, and 7, and since we were given /V 7 = 15 and 
have just found that /V 4 = Aj-, = 10, then 

40 = N(G) = Al 3 + AI4 + + 10 + 10 + 15 or V 3 = 5 

But region 3 is the subset F fl L fl G of students taking all three languages. Thus, 
there are five trilingual students. 

The general line of attack in these problems is to break the Venn diagram into 
the eight regions shown in Figure 8.5. For each subset whose size is given, write that 
number as a sum of V, ’s of regions i in that subset. By combining these equations (and 
sometimes solving them simultaneously), one can eventually determine the number 
of elements in each region. Then the size of any subset is readily found as the sum 
of the sizes of the regions in that subset. For example, in the preceding paragraph 
we determined all V, s except Ni and Ng. But /Vo + /V3 = 10 and N 3 was found to be 
5; thus Ni = 5. Ng — N(F fl L fl G) is the number of students taking no language. 
Since all regions total to N, then 

7 

N s = N-J2 N ‘ = 100- (20 + 5 + 5+ 10 + 20+ 10+ 15) 

;= 1 

= 100-85 = 15 ■ 

Note that because there are eight underlying variables (the sizes of the eight basic 
regions), we need to be given eight pieces of information to be able to solve such a 
problem. 
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Summary of Exercises The first 30 exercises are similar to the 
examples in this section. The last six exercises involve more complicated Venn 
diagram arguments; see the last paragraph in Example 6 for the strategy for solving 
these problems. 


1. How many 8-letter “words” using the 26-letter alphabet (letters can be repeated) 
either begin or end with a vowel? 

2. How many 9-digit Social Security numbers are there with repeated digits? 

3. How many n-digit ternary sequences are there in which at least one pair of 
consecutive digits are the same? 
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4 . What is the probability that at least two heads (not necessarily consecutive) will 
appear when a coin is flipped eight times? 

5 . What is the probability that a 7-card hand has at least one pair (possibly two 
pairs, three of a kind, full house, or four of a kind)? 

6. If n people of different heights are lined up in a queue, what is the probability 
that at least one person is just behind a taller person? 

7 . How many 5-digit numbers (including leading Os) are there with exactly one 8 
and no digit appearing exactly three times? 

8. Suppose 40% of all families own a dishwasher, 30% own a trash compacter, 
and 20% own both. What percentage of all families own neither of these two 
appliances? 

9. Among 700 families, 1 50 families have no children, 1 80 have only boys, and 200 
have only girls. How many families have boy(s) and girl(s)? 

10 . How many ways are there to pick five people for a committee if there are six 
(different) men and eight (different) women and the selection must include at 
least one man and one woman? 

11 . Suppose a bookcase has 300 books, 70 in French and 100 about mathematics. 
How many non-French books not about mathematics are there if 

(a) There are 40 French mathematics books? 

(b) There are 60 French nonmathematics books? 

12 . How many arrangements of the 26 different letters are there that 

(a) Contain either the sequence “the” or the sequence “aid”? 

(b) Contain neither the sequence “the” nor the sequence “math”? 

13 . If you pick an integer between 1 and 1 ,000, what is the probability that it is either 
divisible by 7 or even (or both)? 

14 . How many arrangements of the letters in INVITING are there in which the three 
Is are consecutive or the two Ns are consecutive (or both)? 

15 . A school has 200 students with 85 students taking each of the three subjects: 
trigonometry, probability, and basket- weaving. There are 30 students taking any 
given pair of these subjects, and 15 students taking all three subjects. 

(a) How many students are taking none of these three subjects? 

(b) How many students are taking only probability? 

16 . Suppose 60% of all college professors like tennis, 65% like bridge, and 50% like 
chess; 45% like any given pair of recreations. 

(a) Should you be suspicious if told 20% like all three recreations? 

(b) What is the smallest percentage who could like all three recreations? 

17 . How many numbers between 1 and 30 are relatively prime to 30? 

18 . How many numbers between 1 and 280 are relatively prime to 280? 
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19. How many ways are there to assign 20 different people to three different rooms 
with at least one person in each room? 

20 . How many «-digit numbers are there with at least one of the digits 1 or 2 or 3 
absent? [Hint: This is a union problem; find NiA \ U ,4 2 U A 3 ).] 

21. How many arrangements are there of MURMUR with no pair of consecutive 
letters the same? 

22 . If three couples are seated around a circular table, what is the probability that no 
wife and husband are beside one another? 

23 . How many ways are there to form a committee of 10 mathematical scientists from 
a group of 15 mathematicians, 12 statisticians, and 10 operations researchers with 
at least one person of each different profession on the committee? 

24 . Find the number of ways to arrange the six numbers 1, 2, 3, 4, 5, 6 such that either 
in the arrangement 1 is immediately followed by 2, or 3 is immediately followed 
by 4, or 5 is immediately followed by 6 . [Hint: This is a union problem.] 

25 . How many ways are there to deal a 6 -card hand that contains at least one Jack, 
at least one 8 , and at least one 2 ? 

26 . How many ways are there to arrange the letters in the word MISSISSIPPI so that 
either all the Is are consecutive or all the Ss are consecutive or all the Ps are 
consecutive? 

27. How many arrangements are there of TAMELY with either T before A, or A 
before M, or M before E? By “before,” we mean anywhere before, not just 
immediately before. [Hint: This is a union problem.] 

28 . How many arrangements are there of MATHEMATICS with both Ts before both 
As, or both As before both Ms, or both Ms before the E? Note that “before” is 
used as in Exercise 27. 

29. The Bernsteins, Hendersons, and Smiths each have five children. If the 1 5 children 
of these three families camp out in five different tents, where each tent holds three 
children, and the 15 children are randomly assigned to the five tents, what is the 
probability that every family has at least two of its children in the same tent? 

30 . Suppose 45% of all newspaper readers like wine, 60% like orange juice, and 55% 
like tea. Suppose 35% like any given pair of these beverages and 25% like all 
three beverages. 

(a) What percentage of the readers likes only wine? 

(b) What percentage of the readers likes exactly two of these three beverages? 

31. Suppose that among 40 toy robots, 28 have a broken wheel or are rusted but 
not both, six are not defective, and the number with a broken wheel equals the 
number with rust. How many robots are rusted? 

32 . Suppose a school with 11 students offers two languages, PASCAL and BASIC. If 
30 students take no language, 70 students do not take just PASCAL (i.e., either 
they do not take PASCAL or they take both languages), 80 students do not take 
just BASIC, and 20 students take both languages, determine n. 
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33 . Suppose a school with 120 students offers yoga and karate. If the number of 
students taking yoga alone is twice the number taking karate (possibly, karate 
and yoga), if 25 more students study neither skill than study both skills, and if 
75 students take at least one skill, then how many students study yoga? 

34 . A survey of 150 college students reveals that 83 own cars, 97 own bikes, 28 own 
motorcycles, 53 own a car and a bike, 14 own a car and a motorcycle, seven own 
a bike and a motorcycle, and two own all three. 

(a) How many students own just a bike? 

(b) How many students own a car and a motorcycle but not a bike? 

35 . Suppose that among 150 people on a picnic, 90 bring salads or sandwiches, 80 
bring sandwiches or cheese, 100 bring salads or cheese, 50 bring cheese and 
either salad or sandwiches (possibly both), 60 bring at least two foods, and 20 
bring all three foods. How many people bring just salads? How many people 
bring just sandwiches? 

36 . In a class of 30 children, 20 take Latin, 14 take Greek, and 10 take Hebrew. If no 
child takes all three languages and eight children take no language, how many 
children take Greek and Hebrew? [Hint: Use formula (5) to determine the value 
of the expression N(L n G) + N(L fl H) + N(G fl H).] 



8.2 INCLUSION-EXCLUSION FORMULA 

In this section we generalize the inclusion-exclusion formula for counting N(A\ fl 
A 2 fl A3 ) to n sets A ! , A2 , . . . , A„ . To simplify notation, we will omit the intersection 
symbol “fl” in expressions and write intersected sets as a product. For example, 
A 1 fl A2 fl A3 would be written A1A2A3. Using this new notation, the number of 
elements in none of the sets Ai, A2 , . . . , A„ will be written N(A\ A2 • ■ ■ A„). Recall 
that an inclusion-exclusion formula is so called because of the way it successively 
includes (adds) and excludes (subtracts) the various k-tuple intersections of sets. 

Theorem 1 Inclusion-Exclusion Formula 

Let A] , A2, • • • A„, be n sets in a universe 9 1 of N elements. Let Sk denote the sum of 
the sizes of all /.'-tuple intersections of the A, s. Then 

N(A X M • • • An) = N - S 1 + S 2 - S 3 + • • • + ( — 1)*S* + • • • + (-1)'% (1) 

Proof 

To clarify the definition of the Sk s, Si = JL /V(A;), S 2 — ^L ; /V(A, Aj), Sk is the sum 
of the N{Aj i Aj 1 ■ ■ ■ Aj k ) s for all sets of k Af s, and finally S n = N(A\ A2 • • • A„). We 
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prove this formula by the same method used for N(F L G ) in the previous section: 
we shall show that the net effect of (1) is to count any element in none of the sets Aj 
once and to count elements in one or more Aj ’s a net of 0 times. 

If an element is in none of the Aj ’s — that is , is in A i A 2 ■ ■ ■ A „ , then it is counted 
once in the right-hand side of (1) by the term N and is not counted in any of the 5* s. 
So the count is 1 for each element in A\ A 2 ■ ■ ■ A n , as required. An element in exactly 
one Aj is counted once by N, is subtracted once by Si (since it is in one of the A/s), 
and is counted in none of the other S* s — for a count of 1 — 1 = 0, as required. Now 
more generally let us show that an element x that is in exactly m of the A/s has 
a net count of 0 in (1). Element x is counted once by N, is counted m times by Si 
(since x is in m A/s), is counted C(m, 2) times by S2 [since x is in the intersection 
AjAj for the C(m, 2) pairwise intersections involving two of the m sets containing 
x ], . . . , and, in general, is counted C(w, k) times by S/ 0 k < m. It is not counted in 
S/, when h > m . So the net count of x in (1) is 

>-(,)+( 2 )-( 3 ) + ^ +( “ 1) (J + " +< - |) (,„) <2) 

This alternating sum of binomial coefficients can be evaluated from the binomial 
expansion 

(i+,r = !+("/+(”/;+. ■■+(“/* <3) 

+■■■+(>" 

If we set x = — 1 in (3), the right side of (3) is now the expression in (2), whereas the 
left side of (3) becomes [1 + (— l)] m = O'" = 0. Thus, (2) equals 0, as required. ♦ 

Corollary 

Let Ai, A2 ■ • ■ A„ be sets in the universe 91. Then 

N(A\ U A2 U • ■ ■ U A„) = Si — S 2 + S3 

_... + (_l/- 1 S i + ... + (-l)”- 1 S„ (4) 

Proof 

We write formula ( 1 ) as 

N(AxM ■ • • A„) = N - [Si - S 2 + S 3 - ■ • • + (-ly'S/ (5) 

Next we observe that the number of elements in none of the sets equals the total 
number of elements minus the number of elements in one or more sets — that is, 

N (Ai A 2 ■ • • A„) = N — N(Ai U A 2 U ■ • • U A„) (6) 

Comparing (5) and (6), we see that the expression in brackets on the right side of (5) 
is A(AiUA 2 U---UA„). ♦ 
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Before giving examples of the inclusion-exclusion formula, we want to empha- 
size an important logical point about applying this formula. To use this formula in 
a counting problem, one must select a universe ‘’ll and a collection of sets A, in that 
universe such that the outcomes to be counted are the subset of elements in 6 U that are 
in none of the A,s. That is, the A,s represent properties not satisfied by the outcomes 
being counted. 


Example 1: Card Hands with No Suit Voids 

How many ways are there to select a 6-card hand from a regular 52-card deck such 
that the hand contains at least one card in each suit? How many 6-card hands with a 
void in at least one suit? 

The universe At should be the set of all 6-card hands. We need to define the sets 
A, such that hands with at least one card in each suit are in none of the A, s. With a 
moment’s thought, we see that at least one card in a suit is equivalent to no void in 
the suit. Thus, we let A i be the set of 6-card hands with a void in spades; Ai a void 
in hearts; A3 a void in diamonds; and A 4 a void in clubs. Now the first part of the 
problem asks for N(A\ A2A3A4), and we can use the inclusion-exclusion formula. 

We must next calculate N, Si, S2, S3, and S4. As noted in Chapter 5, a 6-card 
hand is simply a subset of six cards, and so N — C(52, 6). The size of A 1 , the set 
of hands with a void in spades, is simply the number of 6-card hands chosen from 
the 52 — 13 = 39 nonspade cards. So N{A\) — C( 39, 6). Likewise, V(A,) = (39, 6), 
i = 2, 3, 4, and so Si = 4 x C(39, 6). The hands in Ai A2 are hands chosen from the 
26 non-(spades or hearts) cards, and so N(AiAt) = C( 26, 6). There are C(4, 2) = 6 
different intersections of two out of the four sets, and so S2 = 6 x C(26, 6). There 
are C(4, 3) = 4 different triple intersections of the sets and each has C(13, 6) hands. 
So S3 = 4 x C(13, 6). Finally, a hand cannot be void in all four suits, and so S4 = 0. 
Then by (1), 


N (Ai A2A3A4) 




+ 0 


The second part of the problem, counting 6-card hands with a void in at least one 
suit, asks for N(Ai U A2 U A3 U A4), which by formula (4) in the Corollary, equals 
Si — S2 + S3 — S4. So the answer for the second part is 



We note that, in general, S/. is a sum of C(n, k ) different k-tuple intersections of 
the n Ai ’s. 


Example 2: Distributions with an Empty Box 

How many ways are there to distribute r distinct objects into five (distinct) boxes with 
at least one empty box? 
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We do not need to determine N(A\ A 2 A 3 A 4 A 5 ) in this problem, because it does 
not concern outcomes where some property does not hold for all boxes. Rather, this 
is a union problem, using the corollary’s formula. 

Let ‘’ll be all distributions of r distinct objects into five boxes. Let A; be the set 
of distributions with a void in box i. Then the required number of distributions with 
at least one void is N(A\ U A 2 U • ■ ■ A 5 ). We have N — 5 '\ (V(A,) — 4 r (distributions 
with each object going into one of the other four boxes), N(A,Aj) = 3 r , and so forth. 
As just noted, there are C(5, k) subsets in Sf,, k = 1,2, . . . , 5 . Thus by ( 4 ), 


N(A 1 U A2 U • • ■ A5) = Si — S2 + S3 — S4 + S5 



It is easy to mistake union problems, which use phrases such as “with at least one 
empty box,” with standard inclusion-exclusion problems, which use phrases such as 
“at least one object in every box.” The former ask for at least one of a set of properties 
to hold, whereas the latter ask for every property to hold. Moreover, the latter problems 
must be solved by using the complementary properties, such as “box i is empty,” and 
determining all ways for none of these complementary properties to hold. The reader 
has to reason carefully through a counting problem and determine whether to frame 
the solution as counting outcomes where none of a set of properties hold or outcomes 
where one or more properties hold. 

Another common source of confusion is the subscripts of the Si s and the sub- 
scripts of the Aj’s. In example 1, students sometimes define Ai to be all hands with a 
void in one suit, A 2 to be all hands with a void in two suits, and so on. This is wrong. 
The subscript of the As is an ordinal (ordering) number. A\ denotes the first set 
(in Example 1, the set of all hands with a void in spades), A 2 the second set, A 3 the 
third set, .... The subscript of the Ss is a cardinal (magnitude) number. Si is the sum 
of the sizes of all single sets, S 2 the sum of the sizes of all pairwise intersections of 
sets, S 3 the sum of the sizes of all 3-way intersections of sets, .... 


Example 3: Upper Bounds on Integer Solutions 

How many different integer solutions are there to the equation 

xi + X 2 + X 3 + X 4 + xs + xc, — 20 0 < Xi < 8 

This type of problem was solved with generating functions in Section 6.2. Now 
we solve it with an inclusion-exclusion approach. Let S U be all integer solutions with 
Xi > 0, and let A, be the set of integer solutions with x, > 8 , or equivalently x, > 9. 
Then the number of solutions with 0 < x, < 8 will be N(Ai A2 • • • AL). 

Recalling the formulas for integer solutions of equations from Section 5.5 
(Example 5) we have 

/ 20 + 6 — 1 \ _ / 25 

V 20 ) ~ V20 


N = 
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To count N(A\), the outcomes with X\ > 9, we consider the x,’s to be amounts 
of objects chosen when a total of 20 objects are chosen from six types. The outcomes 
with at least nine of the first type can be generated by first picking nine of the first 
type and then picking the remaining 20 — 9 objects without restriction from the six 
types. This reasoning applies to all N(Aj ): 


N(Ai) 


( (20 — 9) + 6 — 1 
V (20-9) 


16 

11 


By a similar reasoning, now first choosing nine from two types and then choosing 
the remaining 20 — 9 — 9 objects without restriction, we have 


N(A I A/) = 


(20 — 9 — 9) + 6— 1 
(20-9-9) 


For a solution to be in three or more A, ’s, the sum of the respective x, ’s would exceed 
20 — impossible. So Sj = 0 for j > 3, and 


N(A l A 2 ---A 6 ) = N-S l + S 2 = (20) — (l) (ll) 



Recall that to use generating functions to solve the preceding problem, we would 
seek the coefficient of x 20 in 


( 1 + x + • • • + x s ) 6 = [(1 - x 9 )/(l - x)] 6 = (1 - x 9 ) 6 ( 1 - x)- 6 

The coefficient of x 20 in this product is aob 2 o + agbn + a^b 2 , where a* is the coeffi- 
cient of x k in (1 — x) -6 and is the coefficient of x k in (1 — x 9 ) 6 . This coefficient of 
x 20 turns out to be exactly the foregoing expression for N{A\A 2 ■ • ■ A( t ). The factor 
(1 — x 9 ) 6 = 1 — C(6, l)x 9 + C( 6, 2)x 18 . . . does the inclusion-exclusion task of sub- 
tracting cases where one x ; is at least 9 and adding back cases where two x, ’s are 
at least 9. By using generating functions to solve this problem, we did not need to 
know anything about the inclusion-exclusion complexities of this problem. Generat- 
ing functions automatically performed the required inclusion-exclusion calculations! 


Example 4 : Retrieving Hats 

What is the probability that if n people randomly reach into a dark closet to retrieve 
their hats, no person will pick his own hat? 

The probability will be the fraction of outcomes in which no person gets her own 
hat. Our universe Tl will be all ways for the n people to successively select a different 
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hat. So N — n ! If A, is the set of outcomes in which person i gets her own hat, then 
N(A\A 2 ■ ■ ■ A„) counts the required outcomes where no one gets her own hat. 

Then N(Ai ) = (n — 1)!, since given that person i gets her hat, the number of 
possible outcomes is all ways for the other n — 1 people to select hats. Similarly 
N{AjAj ) = (n — 2)!, and generally N{Aj i Aj 1 ■ ■ ■ Aj t ) = (n — k)\ for k-way intersec- 
tions. Since Su is a sum of C(n, k) A:-way terms, we obtain by Eq. (1) 

N(A l A 2 ---A n ) = N-S l +S2 + --- + (-l) k S k + ---+(-irS n 

= «!-(”)(« — 1)! + (”)(« — 2)! + .-. 

+(-l)*(”)(n — £)! + -. + (-If Qo! 
k = o K 


Recalling that ^ j 


k\(n — k)\ 


, we see that 


(>-* )! 


n ! 

= — So 
k\ 


- ^(-1/n! 1 )* 

Af(A 1 A 2 ...A„) = X; L ^— 


k = 0 


k = 0 


A:! 


and now the probability that no person gets her own hat is 


N(A l A 2 ---A„) .^(-D* / . . . , 1 1 

= n ! > — ~ — / «! = 1 — 1 + — — — 


(-D" 




jt=0 


/c! 


2! 3! 


(7) 


This alternating series is the first n + 1 terms of 1)V^> which is the power 

series for e x when x = — 1. The series converges very fast. The difference between 
e~ l and (7) is always less than l/«! For example, = 0.367879 . . . and for n — 8, 
the series in (7) equals 0.367888 (even for n — 5, it is 0.366). Thus, for all but very 
small n, the desired probability is essentially e~ l . The answer is independent of n. ■ 


The problem treated in Example 4 is equivalent to asking for all permutations 
of the sequence 1,2 , ,n such that no number is left fixed, that is, no number i is 
still in the fth position. Such rearrangements of a sequence are called derangements. 
The symbol D„ is used to denote the number of derangements of n integers. From 
Example 4, we have 


D„ = n\ y. 


(-D* 


: n\e 


-l 


k = 0 


k\ 
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For exact values of D„, it is usually easier to use the following simple recurrence 
relation: 

D, , — n D n _\ + (- 1 )” n > 2 
See Exercise 36 for details on deriving this recurrence. 


Example 5: Graph Coloring 

How many ways are there to color the four vertices in the graph shown in Figure 8.6 
with n colors (such that vertices with a common edge must be different colors)? 

We label the edges e\, e 2 , e 3, e 4 , e$, as shown in Figure 8.6. The universe should 
be the set of all ways to color the vertices. So N = n 4 ( n choices for each vertex). The 
situation we must avoid is coloring adjacent vertices with the same color. For each 
edge e,- , we define the set A, to be all colorings of the graph in which the vertices at 
either end of edge e ,• have the same color. Then N(A\ A2A3A4A5) will be the desired 
number of proper colorings with n colors. 

Then N(Aj ) — n 3 , since the two vertices connected by e, get one color and the 
two other vertices each get any color, and S 1 = 5 x n 3 . Similarly, N( A; A ; ) = n 2 and 
S2 = C( 5 , 2 ) x n 2 . A little care must be used with three-way intersections. Specifi- 
cally, edges ei, e 2 > e? interconnect three, not four, vertices, and so N(A\ A2A3) = n 2 
(one color for vertices xj, X2,X3 and one color for vertex X4). Fikewise, /VM3 A4A3) = 
n 2 . Any other set of three edges interconnect to all four vertices, and so the associ- 
ated N{AiAjAk) equals n. Then S3 = 2 x n 2 + [C( 5 , 3 ) — 2 ] x n. Four or five edges 
always interconnect all four vertices, and so S 4 = C( 5 , 4 ) x n and S5 = n . Then the 
answer to our problem is 

N(Ai A 2 A 3 A 4 A 5 ) = n 4 - ^ n 3 + 

— 2n 2 + — 2^ x n +(\n — n 

— n 4 — 5 n 3 + 8 n 2 — 4 n m 

Note that the preceding expression is the chromatic polynomial of this graph (see 
Section 2 . 4 ). 

We conclude this section with two generalizations of the inclusion-exclusion 
formula. Many readers may want to skip this material. 
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Theorem 2 

If A\, A 2 , . . . , A„ are n sets in a universe <31 of N elements, then the number N,„ of 
elements in exactly m sets and the number N* of elements in at least m sets are given 
by 


N m — S m 


m + 1 


y in — ‘-'m l l ^m+1 

m 1 


Sm + 1 + 


m + 2 


+ ■ •• + (-!)” 


Sn 


"*■-*■-1 + C -\ 


+ • •• + (-!)" 


n — 1 
m — 1 


Sn 


Sm+2 +•••+(—!)' 


K-m , ~ , Sk 


( 8 ) 


Sm+2-\ f (“I)* m ( m _ j ) Sk 


(9) 


Proof 

The formula for N m can be proved in a fashion similar to the proof of the inclusion- 
exclusion formula. All elements in exactly m sets will be counted once in S m , and not 
counted in any other term in (8), as required. We must show that elements in more 
than m sets are counted a net of 0 times in (8). In this formula the count is slightly 
trickier to sum. Readers who dislike such technicalities should skip this proof. 

The count for an element in r sets, m < r < n, is C(r, k) in Sk, and so the net 
count of this element in (8) is 



Remember that the element is not counted in Sk for k > r. Recall from Example 1 of 
Section 5.5 that the number of ways to pick k objects from r and then pick m special 
objects from those k is equal to the number of ways to pick the m special objects from 
the r first and then pick k — m more from the remaining r — m. So 



Using this substitution, (10) now becomes 



336 Chapter 8 Inclusion-Exclusion 


As in the proof of Theorem 1, the expression in brackets here is just the expansion of 
(1 + x) r ~ m with x = — 1 So (10) sums to 0, as required. 

The formula for N* n can be verified with induction by showing that formula (9) 
for N* satisfies N* = N m + /V* +l : “In at least m sets” means “in exactly m sets” or 
“in at least m + 1 sets.” ♦ 

Example 6 

Find the number of 4-digit ternary sequences with exactly two Is. Also find the 
number with at least two Is. 

Letdl be all 4-digit ternary sequences. If A, is the set of 4-digit ternary sequences 
with a 1 in position then N 2 and N* are the numbers of 4-digit ternary sequences 
with exactly two Is and at least two Is, respectively. Then N = 3 4 and 5j = C( 4, 1)3 3 , 
S 2 = C( 4, 2)3 * 1 2 3 4 5 6 , S 3 = C( 4, 3)3 \ and S 4 = C( 4, 4)3° = 1. Now by formulas (8) and 
(9), the desired numbers are 



The observant reader may have noted that N 2 can be computed directly by a 
simple combinatorial argument. 



8.2 EXERCISES 


Summary of Exercises These exercises require a combination of 
inclusion-exclusion modeling and Chapter 5 type of enumeration skills (to solve the 
subproblems, some of which are tricky). Exercises 37-47 use Theorem 2. 


1. How many m-digit decimal sequences (using digits 0, 1, 2, . . . , 9) are there in 
which digits 1, 2, 3 all appear? 

2. How many ways are there to roll eight distinct dice so that all six faces appear? 

3. What is the probability that a 9-card hand has at least one 4 of a kind? 

4. How many positive integers <630 are relatively prime to 630? 

5. What is the probability that a 13-card bridge hand has 

(a) At least one card in each suit? 

(b) At least one void in a suit? 

(c) At least one of each type of face card (face cards are Aces, Kings, Queens, 
and Jacks)? 

6. Given six pairs of non-identical twins, how many ways are there for six teachers 
to each choose two children with no one getting a pair of twins? 
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7 . How many arrangements are there of a, a, a, b, b, b, c, c, c, without three 
consecutive letters the same? 

8. Given 2 n letters, two of each of n types, how many arrangements are there with 
no pair of consecutive letters the same? 

9 . How many permutations of the 26 letters are there that contain none of the 
sequences MATH, RUNS, FROM, or JOE? 

10 . How many integer solutions of x i + X 2 + X 3 + X 4 = 28 are there with 

(a) 0 <Xi< 12 ? 

(b) — 10 < jc <20? 

(c) 0 < Xj, x\ < 6 ,X 2 < 10, X 3 < 15, X 4 < 21? 

11 . How many ways are there to distribute 26 identical balls into six distinct boxes 
with at most six balls in any of the first three boxes? 

12 . How many 5-digit numbers (including leading Os) are there with no digit appear- 
ing exactly two times? 

13 . How many ways are there for a child to take 10 pieces of candy with four types 
of candy if the child does not take exactly two pieces of any type of candy? 

14 . Santa Claus has five toy airplanes of each of n plane models. How many ways 
are there to put one airplane in each of r(r > n) identical stockings such that all 
models of planes are used? 

15 . A wizard has five friends. During a long wizards’ conference, it met any given 
friend at dinner 10 times, any given pair of friends five times, any given threesome 
of friends three times, any given foursome two times, and all five friends together 
once. If in addition it ate alone six times, determine how many days the wizards’ 
conference lasted. 

16 . How many secret codes can be made by assigning each letter of the alphabet a 
(unique) different letter? Give an approximate answer using Euler’s constant e. 

17 . How many ways are there to distribute 1 0 books to 1 0 children (one to a child) and 
then collect the books and redistribute them with each child getting a new one? 

18 . How many arrangements of 1, 2, .... n are there in which only the odd integers 
must be deranged (even integers may be in their own positions)? 

19 . How many ways are there to assign each of five professors in a math department 
to two courses in the fall semester (i.e., 10 different math courses in all) and then 
assign each professor two courses in the spring semester such that no professor 
teaches the same two courses both semesters? 

20 . Consider the following game with a pile of n cards numbered 1 through n. 
Successively pick a different (random) number between 1 and n and remove all 
cards in the pile down to, and including, the card with this number until the pile is 
empty. If the chosen card number has already been removed, pick another number. 
What is the approximate probability that at some stage the number chosen is the 
card at the top of the pile? {Hint: Approach as an arrangement with repetition.) 
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21. There are 15 students, three (distinct) students each from five different high 
schools. There are five admissions officers, one from each of five colleges. Each 
of the officers successively picks three of the students to go to their college. 
How many ways are there to do this so that no officer picks three students from 
the same high school? 

22. The rooms in the circular house plan shown below are to be painted using eight 
colors such that rooms with a common doorway must be different colors. In 
how many ways can this be done? 



23. How many ways are there to color the vertices with n colors in the following 
graphs such that adjacent vertices get different colors? 


(a) 







24. The four walls and ceiling of a room are to be painted with five colors available. 
How many ways can this be done if bordering sides of the room must have 
different colors? 

25. (a) How many arrangements of the integers 1 through n are there in which i is 

never immediately followed by i + 1, for i = 1,2, . . . ,n — 1? 

(b) Show that your answer equals D n + D„_\ . 

26. Repeat Exercise 25(a) now considering n to be followed by 1. 

27. If two identical dice are rolled n successive times, how many sequences of 
outcomes contain all doubles (a pair of Is, of 2s, etc.)? 

28. How many ways are there to seat n couples around a circular table such that no 
couple sits next to each other? 

29. If there are n families with five members each, how many ways are there to seat 
all 5n people at a circular table so that each person sits beside another member 
of his/her family? (Hint: This is not a standard inclusion-exclusion problem but 
uses a type of inclusion-exclusion argument.) 

30. How many arrangements of a, a, a, b, b, b, c, c, c have no adjacent letters the 
same? (Hint: This is tricky — not a normal inclusion-exclusion problem.) 

31. Suppose that a person with seven friends invites a different subset of three 
friends to dinner every night for one week (seven days). How many ways can 
this be done so that all friends are included at least once? 
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32. There are six tennis players and each week for a month (four weeks), a different 
pair of the six play a tennis match. How many ways are there to form the 
sequence of 4 matches so that every player plays at least once? 

33. There are 10 different people. Each person orders three donuts, chosen from five 
types of donuts. How many ways are there to do this such that (i) at least one 
person chooses all three donuts of the first type, (ii) at least one person chooses 
all three donuts of the second type, . . ., (v) at least one person chooses all three 
donuts of the fifth type? 

(a) Two (or more) people may choose the same collection of three donuts. 

(b) Each person must choose a different collection of three donuts. 

34. There are eight Broadway musicals and they offer a special three-night package 
(Friday, Saturday, Sunday nights) where one can order one ticket that is good 
for three different musicals on successive nights (a sequence of three different 
musicals). A travel agent plans to order 30 of these tickets for a tour group of 30 
people. How many ways are there to order a subset of 30 such tickets with the 
constraint that each of the eight musicals appears on at least one ticket? 

35. A company produces eight different designs for sweaters. Each sweater is made 
from three different pieces of cloth (top piece, middle piece, bottom piece). 
There are six different colors available for each piece, and the three pieces in a 
sweater must each be a different color. How many collections (subsets) of eight 
designs are possible if each color appears in at least one of the designs? 

36. Find the number of ways to give each of six different people seated in a circle 
one of m different types of entrees if adjacent people must get different entrees. 

37. How many ways are there to distribute r distinct objects into n indistinguishable 
boxes with no box empty? 

38. (a) Show that D n satisfies the recurrence D n = (n — l)(Z>„_i + D„_ 2 ). 

(b) Rewriting the recurrence in part (a) as D n — nD n -\ = — [D„_i — (n — 1) 
D n _ 2 ], iterate backwards to obtain the recurrence D n — nD„_\ + (—1)". 

(c) Use part (b) to make a list of D n values up to n =10. 

(d) Use part (a) to show that D( x) — e~ x (l — x)~ 1 is the exponential generating 
function for D n . 

39. How many ways are there to distribute r distinct objects into n distinct boxes 
with exactly three empty boxes? With at least three empty boxes? 

40. How many ways are there to deal a six-card hand with at most one void in a suit? 

41. How many ways are there to arrange the letters in INTELLIGENT with at least 
two consecutive pairs of identical letters? 

42. If n balls labeled 1, 2, . . . , n are successively removed from an urn, a rencontre 
is said to occur if the ith ball removed is numbered i. If the n balls are removed 
in random order, what is the probability that exactly k rencontres occur? Show 
that this probability is about e~ l /k\. 
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43. Show that the number of ways to place r different balls in n different cells with 
m cells having exactly k balls is 

(~1 fn\r\ ■ ( n-jY-i k 

ml (j - m)\(n - j)\(r - jk)\(k\)i 

44. (a) If g(x ) is the (ordinary) generating function for N m (see Theorem 2), show 

that g(x ) = Y^k=o $k(x ~ 1)*. This g(x) is called the hit polynomial. 

(b) Show that 2[g(l) + g(— 1)] is the number of elements in an even number of 

A,-s. 

(c) U se part (b) to determine the number of n-digit ternary sequences with an even 
number of Os. Simplify your answer with a binomial expansion summation. 


45. Show that 


Sin — ^ 


k—m 


N,„ 


46. Use a combinatorial argument (with inclusion-exclusion) to prove the following : 


m\ ( n — k 


(a) £(-l)* 

k = 0 
m 

(b) £(“!)*' 

k=0 

(c) E(-1)* 

k—m 

(d) E(-l)*' 

k = 0 


47. Use Theorem 2 to show that 


n — m 
n — r 


m < r < n 


n\ / n — k 
k ) \m — k 
n 
k 

n\ ( n — k + r — 1 


= 0 

n — 1 
m — 1 


r- 1 
71—1 


= D-d 


k—m 


k—m 


m ) \ k 


'in—k 


48. Prove formula (9) in Theorem 2. 

49. In Example 4, let the random variable X — number of people who get their own 


hat. Find E(X). 



8.3 RESTRICTED POSITIONS AND 
ROOK POLYNOMIALS 

In this section we consider the special problem of counting arrangements of n objects 
when particular objects can appear only in certain positions. We will solve one such 
problem involving five objects in careful detail. In the process we will indicate how 
to generalize our analysis to any restricted-positions problem. 
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Figure 8. 7 


Positions 



Consider the problem of finding all arrangements of a, b, c, d, e with the restric- 
tions indicated in Figure 8.7. That is, a may not be put in position 1 or 5; b may not be 
put in 2 or 3; c not in 3 or 4; and e not in 5. There is no restriction on d. A permissible 
arrangement can be represented by picking five unmarked squares in Figure 8.7, with 
one square in each row and each column. For example, the permissible arrangement 
badec corresponds to picking squares (a, 2), ( b , 1), (c, 5), (d, 3), (e, 4). 

When viewed in terms of the 5x5 array of squares, the arrangement problem can 
be thought of as a matching problem, matching letters with positions. In Section 4.4 
we also considered matching problems. There we wanted to determine whether any 
complete matching existed. Here we want to count how many complete matchings 
there are. 

We use the inclusion-exclusion formula, expression (1) of the previous section, 
to count the number of permissible arrangements for Figure 8.7. Let Tl be the set 
of all arrangements of the five letters without restrictions. So N = 5!. Let A, be the 
set of arrangements with a forbidden letter in position i (note that we could equally 
well define the properties in terms of the ;th letter being in a forbidden position). The 
number of permissible arrangements will then be N( A i A 2 A 3 A 4 A 5 ). In terms of Figure 
8.7, A, is the set of all collections of five squares, each in a different row and column 
such that the square in column i is a darkened square. We obtain /V < A , ) by counting the 
ways to put a forbidden letter in position i times the 4! ways to arrange the remaining 
four letters in the other four positions (we do not worry about forbidden positions 
for these letters). Then N(A\) =1x4!, A7 (A 2 ) =1x4!, N(A^) = 2x4!, N(A 4 ) = 
1 x 4!, and N(A$) = 2x4!. Collecting terms, we obtain 

5 

Si = ^A(A,)= 1 x 4! +1x4! + 2x4! +1x4! + 2x4! 

1 = 1 

= (1 + 1 + 2+1 + 2)4! =7x4! 

Observe that (1 + 1+ 2+1+ 2) = 7 is just the number of the darkened squares in 
Figure 8.7. Since each choice of a darkened square (i.e., some letter in some forbidden 
position) leads to 4! possibilities, then 

Si = (number of darkened squares) x 4! 

for any restricted-positions problem with a 5 x 5 family of darkened squares similar 
to Figure 8.7. 
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Next, N (A, A / ) will be the number of ways to put (different) forbidden letters 
in positions i and j times the 3! ways to arrange the remaining three letters. Or 
equivalently, the ways to pick two darkened squares, one in column i and one in 
column j (and in different rows), times 3!. The reader can check that 

N(A\A{) =1x3! N(A\A 3 ) = 2x3! jV(Aj A 4 ) = 1 x 3! 

N(A l A 5 )= 1 x3! A(A 2 A 3 ) = 1 x3! (V(A 2 A 4 ) = 1 x 3! 

(V(A 2 A 5 ) = 2x3! (V(A 3 A 4 ) =1 x 3! (V(A 3 A s ) = 4 x 3! 

(V(A 4 A 5 ) = 2x3! 

Collecting terms, we obtain 

S 2 = J2N(AiAj) = (1 + 2+ 1 + 1 + 1 + 1 + 2+1+4 + 2)31 = 16x3! 

ij 

The number 16 counts the ways to select two darkened squares, each in a different 
row and column. Generalizing, we will have 

( number of ways to pick k darkened squares 
each in a different row and column 

Since letter d’s row in Figure 8.7 has no darkened squares, there is no way to 
pick five darkened squares, each in a different row and column. Thus S$ = 0. On the 
other hand, tedious case-by-case counting apparently awaits us for S 3 and S 4 . Instead, 
let us try to develop a theory for determining the number of ways to pick k darkened 
squares, each in a different row and column. 

This darkened squares selection problem can be restated in terms of a recreational 
mathematics question about a chess-like game. A chess piece called a rook can 
capture any opponent’s piece on the chessboard in the same row or column as the 
rook (provided there are no intervening pieces). Instead of using a normal 8x8 
chessboard, we “play chess” on the “board” consisting solely of the darkened squares 
in Figure 8.7. 

Counting the number of ways to place k mutually noncapturing rooks on this board 
of darkened squares is equivalent to our original subproblem of counting the number 
of ways to pick k darkened squares in Figure 8.7, each in a different row and column. 
The phrase “k mutually noncapturing rooks” is simpler to say and more pictorial. 

A common technique in combinatorial analysis is to break a big messy problem 
into smaller manageable subproblems. We will develop two breaking-up operations 
to help us count noncapturing rooks on a given board B. 

The first operation applies to a board B that can be decomposed into disjoint sub- 
boards B\ and !h , — that is, subboards involving different sets of rows and columns. 
Often a board has to be properly rearranged before the disjoint nature of the two 
subboards can be seen. 

When the rows and columns of Figure 8.7 are rearranged as shown in Figure 8.8, 
it is obvious that the three darkened squares in rows a and e and columns 1 and 5 are 
disjoint from the four darkened squares in rows b and c and columns 2, 3, and 4. Let 
B be the board of darkened squares in Figure 8.8, let B\ be the three darkened squares 
in rows a and <?, and let Ih be the four darkened squares in rows b and c. 


^ x (5 — k)\ (1) 
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Figure 8.8 

Define r k (B) to be the number of ways to place k noncapturing rooks on board 
B , r k ( B\ ) the number of ways to place k noncapturing rooks on subboard B\ , and 
r k (B 2 ) the number of ways to place k noncapturing rooks on subboard B 2 . There 
are three ways to place one rook on subboard B\ in Figure 8.8, since B\ has three 
squares, and similarly four ways to place one rook on subboard lh- Then r\(B\) = 3 
and f| ( lh ) = 4. A little thought shows that there is only one way to place two rooks 
on subboard B\ and three ways to place two rooks on subboard B 2 . so that r 2 (B\) = 1 
and r 2 (B 2 ) = 3. Note that r k (B\) = r k (B 2 ) = 0 for k > 3, since each subboard has 
only two rows. It will be convenient to define r () = 1 for all boards. 

Observe next that since B\ and B 2 are disjoint, placing, say, two noncapturing 
rooks on the whole board B can be broken into three cases: placing two noncapturing 
rooks on B\ (and none on B 2 ), placing one rook on each subboard, or placing two 
noncapturing rooks on B 2 . Thus we see that 

r 2 (B) = r 2 (B\) + r i(B\ )r i (B 2 ) + r 2 (B 2 ) 
or, using that fact that ro(B 2 ) = r () ( B t ) = 1, 

r i(B) = r 2 (Bi)ro(B 2 ) + r l (Bi)r 1 (B 2 ) + ro(Bi)r 2 (B 2 ) 

= 1x1+3x4+1x3=16 (2) 

Recall that 16 is the number obtained earlier when summing all N(A,Aj ) to count all 
ways to pick two darkened squares each in a different row and column. 

The reasoning leading to (2) applies to ri<(B) for any k and for any board B that 
decomposes into two disjoint subboards B\ and B 2 . 

Lemma 

If B is a board of darkened squares that decomposes into the two disjoint subboards 
B\ and B 2 , then 

n(B) = r k (B 1 )r 0 (B 2 ) + r k ^ 1 (B 1 )r 1 (B 2 )-\ F r 0 (B 1 )r k (B 2 ) (3) 

The observant reader may notice that (3) is very similar to formula (6) in Sec- 
tion 6.2 for the coefficient of a product of two generating functions. That is, if 
f(x) = a r x r and g(x) — J^b r x r , then the coefficient of x k in h{x) = f(x)g(x) 
is a k bo + a k _\b\ + • — f- aob k . We will now exploit this similarity. 

We define the rook polynomial R(x, B) of the board B of darkened squares to be 

R(x , B) — r 0 (B) + r\(B)x + r 2 (B)x 2 -\ 
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Remember that r n ( B) = 1 for all B. Note that the rook polynomial depends only on 
the darkened squares, not on the size of the original assignment diagram. Then for 
B\ and /+ as defined above, we found that 

R(x, B\) = 1 + 3x + lx 2 and R(x, B 2 ) = 1 + 4x + 3x 2 

Moreover, by the correspondence between (3) and the formula for the product of 
two generating functions, we see that a(B), the coefficient of x k in the rook poly- 
nomial R(x, B) of the full board, is simply the coefficient of x k in the product 
R(x, B\)R(x, Z?2 ) - That is, 

R(x, B ) = R(x, B\)R{x, B 2 ) = (l+3x+ lx 2 )(l+4x + 3x 2 ) 

= 1 + 1(3 x 1) + (1 x 4)]x + [(1 x 1) + (3 x 4) + (1 x 3)]x 2 
+ [(1 x 4) + (3 x 3)]x 3 + (1 x 3)x 4 
= 1 + 7x + 16x 2 + 13x 3 + 3x 4 

This product relation is true for any such B , B\, and IE. 

Theorem 1 

If B is a board of darkened squares that decomposes into the two disjoint subboards 
B\ and B 2 then 


R(x, B)=R(x, Bi)R(x, B 2 ) 

Without meaning to belittle the role of generating functions in Theorem 1 , we 
should observe that the generating functions were used here solely because polynomial 
multiplication corresponds to the subboard composition rule given in (3). That is, 
plugging the r^B,) s into two polynomials and multiplying them together is a more 
familiar way to organize the computation required by (3). Shortly we will decompose 
a board B into nondisjoint subboards, and rook polynomials will then play a truly 
essential role. 

Now that R(x, B) has been determined for the board of darkened squares in 
Figure 8.8 and hence in Figure 8.7, we can solve our original problem about the 
permissible arrangements of a , b, c, d, e. Expression (1) for can now be rewritten 

S k = r k (B)(5-k)\ 

By the inclusion-exclusion formula, the number of permissible arrangements is 

N (A1T2A3A4A5) = N — Si + S 2 — S 2 + S 4 . — S 5 

= 5! - ri(,B)4! + r 2 (B) 3! - r 3 (B) 2! + r 4 (B) 1! - r s (B) 0! 

= 5!-7x4! + 16x3! — 13x2! + 3xl!-0x0! 

= 120- 168 + 96-26 + 3-0 = 25 

The values for the r^(B) s came from the rook polynomial Rix. B) for Figure 8.8 
computed above. This rook-polynomial-based variation on the inclusion-exclusion 
formula is valid for any arrangement problem with restricted positions. 
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Theorem 2 

The number of ways to arrange n distinct objects when there are restricted positions 

is equal to 

n\ — r\{B){n - 1 )\ + r 2 (B)(n - 2 )! H f(-l fr k {B){n - k)\ 

+ ■•• +(-l)'V„(B) 0 ! ( 4 ) 

where the r k (B ) s are the coefficients of the rook polynomial R(x, B) for the board B 

of forbidden positions. 

Let us summarize the little theory we have developed. 

1 . Given a problem of counting arrangements or matchings with restricted positions, 
display the constraints in an array with darkened squares for forbidden positions, 
as in Figure 8 . 7 . 

2 . Try to rearrange the array so that the board B of darkened squares can be decom- 
posed into disjoint subboards B\ and lh. 

3 . By inspection, determine the r k (Bj) s, the number of ways to place k noncapturing 
rooks on subboard B, . 

4. Use ther^(B,)sto form the rook polynomials R(x, B\ ) and R(x , Bj), and multiply 
R(x, Bi)R(x, B2) to obtain R(x, B). 

5 . Insert the coefficient r k (B) of R(x, B) in formula ( 4 ). 


Example 1: Sending Birthday Cards 

How many ways are there to send six different birthday cards, denoted C 1, C2, C3, 
C 4, C5, C(,, to three aunts and three uncles, denoted A 1, A2, A3, U\, U2, U3, if aunt A 1 
would not like cards C2 and C4; if A2 would not like C\ or C5; if A3 likes all cards; 
if U\ would not like C\ or C5; if U2 would not like C4; and if t/3 would not like Cj, ? 

The forbidden positions information is displayed in Figure 8 . 9 . We rearrange 
the board by putting together rows with a darkened square in the same column, and 
putting together columns with a darkened square in the same row. For example, rows 
Ci and C5 both have darkened squares in columns A2 and U\, so we put rows Ci 
and C5 beside one another and columns A2 and U\ beside one another; similarly for 



Figure 8.9 
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Figure 8.10 



rows C2 and C 4 and columns A\ and U2 . We get the rearrangement shown in Figure 
8.10. Thus the original board B of darkened squares decomposes into the two disjoint 
subboards, B\ in rows C 1 and C5, and IS 2 in rows C2, C4, and C(,. Actually IS 2 itself 
decomposes into two disjoint subboards IS'-, and IS" , where IS" is the single square 
(C6, f/3). By inspection, we see that 

R(x , B\) = 1 + 4x + 2x 2 

R(x , B 2 ) = R(x, B' 2 )R(x, b;') = (1 + 3x + x 2 )(l + x) 

So 


R(x, B) = R(x, Bi)R(x, B 2 ) 

= (l+4x + 2x 2 )(l + 3x + x 2 )(l+x) 

= 1 + 8x + 22x 2 + 25x 3 + 12x 4 + 2x 5 

Then the answer to the card-mailing problem is 

6 

J2(~V k rk(B)(6-k)\ 

k= 0 

= 6!- 8 x 51 + 22 x41-25 x 3! + 12x2! -2 x 1!+ 0x0! 

= 720-960 + 528- 150 + 24-2 + 0= 160 ■ 

Now let us consider the problem of determining the coefficients of R(x, B) when 
the board B does not decompose into two disjoint subboards. Consider the board IS 
shown in Figure 8.11. Let us break the problem of determining r k ( IS ) into two cases, 
depending on whether or not a certain square s is one of the squares chosen for the 
k noncapturing rooks. Let B s be the board obtained from IS by deleting square s, and 
let B* be the board obtained from B by deleting square s plus all squares in the same 
row or column as s. If s is the square indicated in Figure 8.11, then B s and B* are as 


a 


Figure 8.11 
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shown in Figure 8. 12. If square .y is not used, we must place k noncapturing rooks on 
B s . If square s is used, then we must place k — I noncapturing rooks on 11* . Hence 
we conclude that 

r k {B) = r k {B s ) + r k _ l (B*) (5) 

Using the generating function methods introduced in Section 7.5 for turning a 
recurrence relation into a generating function, we obtain from (5) 

R(x, B) = J2 r k(B)x k = J2 r k (B s )x k + r*_i(5,V 

k k k 

= ^ r k (B s )x k +x^2 r h{B*)x h 

k h 

= R{ x, B s ) + xR(x, B*) 

Multiplying the rook polynomial R( x, 11 * ) by x reflects the fact that one rook is 
placed in square s in combination to any placement of rooks on board II* . Observe 
that B s and II* both break into disjoint subboards whose rook polynomials are easily 
determined by inspection: 

R( x, B s ) = (1 + 3 jc)(1 + 2x) = 1 + 5x + 6.x 2 
R(x, B*) = (l + 2x)(l+x)= 1 +3x + 2x 2 
R(x, B) = R(x, B s ) + xR(x, B*) = (1 + 5x + 6x 2 ) + x(l + 3.x + 2x 2 ) 

= 1 6x 9x~ 2x 2 

These results apply to any board B and any square .v in 11. 

Theorem 3 

Let B be any board of darkened squares. Let s be one of the squares of 11, and let B s 
and 11* be as defined above. Then 

R(x, B) = R(x, B s ) + xR(B*) 

Theorem 3 provides a way to simplify any board’s rook polynomial. If the boards 
B s and IT* do not break up into disjoint subboards, we can reapply Theorem 3 to II, 
and B*. It is important that the square s be chosen to split up II as much as possible. 

Example 2: Nondecomposable Constraints 

In Example 1, suppose that the tastes of uncle U\ change and now he would not 
like card C 2 but would like Cj . The new board of forbidden positions is shown in 
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a 2 u i Aj u 2 u 3 



Figure 8.13. The square in the bottom right comer, call it t, is still disjoint from 
the other squares, call them board B\. Board B\ cannot be decomposed into disjoint 
subboards, and so we must use Theorem 3. The square 5 that breaks up B\ most evenly 
is (C 2 , U\), indicated in Figure 8.13. 

Boards B s and B* shown in Figure 8.14 both decompose into simple disjoint 
subboards: 

R(x, B s ) = (1 + 3x + x 2 )(l + 3x + x 2 ) = 1 + 6x + llx 2 + 6x 3 +x 4 
R(x, B*) = (1 + 2x){\ + 2x) = 1 + 4x + 4x 2 

Then 

R{x, Bi) = R(x, B s ) + xR(x, B *) = (1 + 6x + 1 lx 2 + 6x 3 + x 4 ) 

+x(l +4x + 4x 2 ) 

= 1 + 7x + 15x 2 + 10x 3 + x 4 


and 


R(x, B ) = R(x, B\)R(x, t) 

= (1 + lx + 15x 2 + 10x 3 + x 4 )(l +x) 

= 1 + 8x + 22x 2 + 25x 3 + 1 lx 4 + x 5 

Now by Theorem 2, the number of ways to send birthday cards is 

6! -8 x5! + 22x 41-25 x 3!+ 11 x 2! - 1 x l! + 0x 0! = 159 

Note that uncle U 1 ’s change in tastes changed the final rook polynomial only slightly: 
r 4 ( li ) changed from 1 2 to 1 I . r<,(B) changed from 2 to 1 , and the final answer changed 
from 160 to 159. ■ 


S 


S 


Figure 8.14 
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8.3 EXERCISES 


Summary of Exercises The first nine exercises are similar to the 
examples in this section; the next seven exercises develop theory about rook polyno- 
mials and combinatorial theory based on rook polynomials. 


1. Describe the associated chessboard of darkened squares for finding all derange- 
ments of 1, 2, 3, 4, 5. 

2. Find the rook polynomial for the following boards: 




3. Find the number of matchings of five men with five women given the constraints 
in the figure below on the left, where the rows represent the men and the columns 
represent the women. 




4. Find the number of matchings of five men with five women given the constraints 
in the figure above on the right, where the rows represent the men and the columns 
represent the women. 

5. Seven dwarfs £>i , £F, £> 3 , £) 4 , £> 5 , £>g, D 1 each must be assigned to one of seven 
jobs in amine, J\ , Jo, £3, £4, £5, J(,, £7 - Di cannot do jobs J\ or £3; Do cannot do 
£1 or £5; Z) 4 cannot do £3 or £ f) ; £>5 cannot do Jo or £7; £>7 cannot do £4. £>3 and 
£>6 can do all jobs. How many ways are there to assign the dwarfs to different 
jobs? 
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6 . A pair of two distinct dice are rolled six times. Suppose none of the ordered 
pairs of values (1, 5), (2, 6 ), (3, 4), (5, 5), (5, 3), ( 6 , 1), ( 6 , 2) occur. What is the 
probability that all six values on the first die and all six values on the second die 
occur once in the six rolls of the two dice? 

7. A computer dating service wants to match four women each with one of five men. 
If woman 1 is incompatible with men 3 and 5; woman 2 is incompatible with 
men 1 and 2; woman 3 is incompatible with man 4; and woman 4 is incompatible 
with men 2 and 4, how many matches of the four women are there? 

8. Suppose five officials 0\, Oi, Os , O4, Os are to be assigned five different city 
cars: an Escort, a Lexus, a Nissan, a Taurus, and a Volvo. 0\ will not drive an 
Escort or a Nissan; CL will n °t drive a Taurus; O 3 will not drive a Lexus or a 
Volvo; O 4 will not drive a Lexus; and Os will not drive an Escort or a Nissan. If 
a feasible assignment of cars is chosen randomly, what is the probability that 

(a) 0\ gets the Volvo? 

(b) O2 or Os get the Volvo? (Hint: Model this constraint with an altered board.) 

9. Calculate the number of words that can be formed by rearranging the letters 
EERR1E so that no letters appears at one of its original positions — for example, 
no E as the first, second, or sixth letter. 

10 . Find the rook polynomial for a full n x n board. 

11. An ascent in a permutation is a consecutive pair of the form i,i + 1 . The ascents 
in the following permutation are underlined; 12534. 

(a) Design a chessboard to represent a permutation of 1, 2, 3, 4, 5 so that a check 
in entry (i, j) means that i is followed immediately by j in the permutation. 
Darken entries so as to exclude all ascents. 

(b) Find the rook polynomial for the darkened squares in part (a). 

(c) Find the number of permutations of 1, 2, 3, 4, 5 with k ascents. (Hint: See 
Theorem 2 in Section 8.2.) 

12 . Let R n m (x) be the rook polynomial for an n x m chessboard (n rows, m columns, 
all squares may have rooks). 

(a) Show that R ntn (x^) — Rn — T nix R \ /}l ] (.x ) . 

(b) Show that R n m (x) = nmR n _\ m _i(x). [Hint: Use identity (5) in Section 
5.5.] 

13 . Find two different chessboards (not row or column rearrangements of one 
another) that have the same rook polynomial. 

14 . Consider all permutations of 1 , 2, . . . , n in which i appears in neither position 
i nor i + 1 (n not in n or 1). Such a permutation is called a menage. Let M„(x) 
be the rook polynomial for the forbidden squares in a menage. Let M*(x) be the 
rook polynomial when n may appear in position 1, and let M°(x) be the rook 
polynomial when both 1 and n may appear in position 1 . 

(a) Show that M*(x) = xM*(x) + M®(x), M„(x) = xM®_ l (x) + M*(x), and 
M n (x) = M*(x) + xM*_ l (x). 
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(b) Using the initial conditions Mj*(x) = 1 +x, Mj(jc) = 1, show by induction 
that 


KM = E 


k=0 


2n — k 


and M°(x) = 


k=0 


2n-k-\ 


(c) Find M„(x). 


15. Given an n x m chessboard C n m (see Exercise 12) and a board C of darkened 
squares in C„, m , the complement C' of C in C njn is the board of nondarkened 
squares. 


(a) Show that 


r*(c') = £(- iy 

j = o 


n~J 

k~j 


(b) Show that R(x, C') = x n R(l/x, C). 


f m - j 

\k-j 


C k~j)\r k (C ) 


16. Use Theorem 2 in Section 8.2 to derive a formula for counting arrangements 
when exactly k elements appear in forbidden positions. 



y 8.4 SUMMARY AND REFERENCE 

Frequently the combinatorial complexities in the counting problems in Chapter 5 
arose from simultaneous constraints such as “with at least one card in each suit.” In 
this chapter, we formed a simple set-theoretic model for such problems and solved 
once and for all the combinatorial logic of this model. The resulting formula, the 
inclusion-exclusion formula, was then applied to various counting problems. This 
formula requires the proper set-theoretic restatement of a problem and the solution of 
some fairly straightforward subproblems, but in return the formula eliminates all the 
worry about logical decomposition (as well as worry about counting some outcomes 
twice). After having no help in their problem-solving in Chapter 5, readers should 
find it easy to appreciate fully the power of a formula that does much of the reasoning 
for them. This is what mathematics is all about! 

The last section on rook polynomials provides a nice mini-theory about organiz- 
ing the inclusion-exclusion computations in arrangements with restricted positions. 
The inclusion-exclusion formula was obtained by J. Sylvester about 100 years ago, 
although in the early eighteenth century the number of derangements had been cal- 
culated by Montmort and a (noncounting) set union and intersection version of the 
formula was published by De Moivre. Rook polynomials were not invented until the 
mid-twentieth century (see Riordan [1]). 

1. J. Riordan, An Introduction to Combinatorial Analysis, John Wiley & Sons, 
New York, 1958. 
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CHAPTER 9 

POLYA’S ENUMERATION 
FORMULA 



9.1 EQUIVALENCE AND SYMMETRY GROUPS 


In this chapter we examine a special class of counting problems. Consider the ways 
of coloring the corners of a square black, white, or red: There are three color choices 
at each of the four corners, giving 3x3x3x3 = 81 different colorings. Suppose, 
however, that the square is not in a fixed position but is unoriented, like a square 
molecule in a liquid. Now how many different corner colorings are there? The unori- 
ented figure being colored could be a n-gon or a cube, and the edges or faces could be 
colored instead of the corners. The floating square problem is equivalent to finding 
the number of different (unoriented) necklaces of four beads colored black, white, or 
red. Polya’s motivation in developing his formula for counting distinct colorings of 
unoriented figures came from a problem in chemistry, the enumeration of isomers. 

The difficulty in these problems comes from the geometric symmetries of the 
figure being colored. We develop a special formula, based on this set of symmetries, 
to count all distinct colorings of a figure. With a little more work, we also obtain 
a generating function that gives a pattern inventory of the distinct colorings. For 
example, the pattern inventory of black-white colorings of the corners of a cube with 
all geometric symmetries allowed is 

b s + b 1 w + 3 b 6 w 2 + 3 b 5 w 3 + 7b 4 w 4 + 3 b 3 w 5 + 3 b 2 w 6 + bw 1 + w 8 

where the coefficient of b 1 vv' is the number of nonequivalent colorings with i black 
corners and j white comers. 

The presentation in this chapter avoids the notational complexities and abstract 
framework of traditional presentations of Polya’s Enumeration Formula. Instead, we 
develop the theory for Polya’s Formula by means of a simple concrete example — 
namely, black-white colorings of the corners of an unoriented ( floating) square. Once 
one understands the needed concepts and analysis for 2-coloring an unoriented square, 
it will be easy to state the general formula and apply it to other geometric structures. 
While students may be used to seeing a general formula developed and then applied 
in subsequently examples, many important mathematical results were first obtained 
by working special cases and then generalizing. 

It is impossible to draw an unoriented object; any picture shows it in a fixed 
position. Thus, we start our analysis with the 2 4 = 16 black-white colorings of the 
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fixed square. See Figure 9.1. We can partition these 16 colorings into subsets of col- 
orings that are equivalent when the square is floating. There are six such subsets (see 
the groupings of colored squares shown in Figure 9.1), and so there are six different 
2-colorings of the floating square. The set of fixed colorings would be too large if 
a harder sample problem were used, such as 2-colorings of a cube or 3-colorings 
of the square. 

We seek a theory and formula to explain why there are six such distinct 2-colorings 
of the square. Note that the six subsets of equivalent colorings vary in size. 

To define the partition of a set into subsets of equivalent elements, we first 
define the general concept of the equivalence of two elements a and b. We write this 
equivalence as a ~ b. The fundamental properties of an equivalence relation are 


(i) Transitivity: 

(ii) Reflexivity: a ~ a 

(iii) Symmetry: a ~ b => b ~ a 


All other properties of equivalence can be derived from these three . Any binary relation 
with these three properties is called an equivalence relation. Such a relation defines 
a partition into subsets of mutually equivalent elements called equivalence classes. 


Example 1: Equivalence Relations 

(a) For a set of people, being the same weight is an equivalence relation; all people 
of a given weight form an equivalence class. 

(b) For a set of numbers, differing by an even number is an equivalence relation; the 
even numbers form one equivalence class and the odd numbers the other class. 

(c) For a set of figures, having the same number of corners is an equivalence relation; 
for each n, an equivalence class consists of all n -corner figures. ■ 


Next we turn our attention to the motions that map the square onto itself (see 
Figure 9.2). These motions, or symmetries, are what make the foregoing colorings 
equivalent to another one. Before we develop a theory about symmetries and their 
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Figure 9.2 


relation to coloring equivalence, let us take a closer geometric look at the symmetries 
of a square and of some other figures. 


Example 2: Symmetries of Even n-gons 

Figure 9.2 displays the set of motions that map the square onto itself, the symmetries 
of the square. How was this set obtained? More generally, what is the set of symmetries 
of an n- gon, for n even? 

The symmetries of the square divide into two classes: rotations — circular motions 
in the plane; and reflections (flips) — motions using the third dimension. The rotations, 
about the center of the square, are easy to find. Each rotation is an integral multiple 
of the smallest (nonzero) rotation. The rotations are 7t2 = 90° rotation, n 3 = 180° 
rotation, 714 = 270° rotation, and tt\ = 360° (or 0°) rotation. 

Reflections are a little harder to visualize since they are motions in three dimen- 
sions. The reflections are n 5 = reflection about the vertical axis, tt () = reflection about 
the horizontal axis, ttj = reflection about opposite corners a and c, and tt^ = reflection 
about opposite corners b and d. 

In a regular n-gon, the smallest rotation is (360 /n)° . Any multiple of this (360 jn f 
rotation is again a rotation, and so there are n rotations in all. There are two types 
of reflections for a regular even n-gon: flipping about the middles of two opposite 
sides and flipping about two opposite corners. Since there are n/2 pairs of opposite 
sides and n /2 pairs of opposite corners, a regular even n- gon will have n/2 + n /2 = n 
reflections. Summing rotations and reflections, we find that a regular even n-gon has 
2 n symmetries. 

We leave it as an exercise to show that these 2n symmetries just described are 
distinct. To show that there are at most 2n symmetries of an even n-gon, consider a 
particular corner, call it x, and the edge e incident to x on the clockwise side. The 
position of x and the relative position of e after the action of a symmetry totally 
determine the symmetry (readers should convince themselves of this). A symmetry 
could map x to any of the n comers of the n-gon — n choices — and e could be mapped 
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Figure 9.3 Symmetric reflection of a pentagon 

to either side of x — 2 choices. In total, there are 2n possible different symmetries of 
the even «-gon. ■ 

Example 3: Symmetries of Odd n-gons 

Describe the symmetries of a pentagon, and more generally, of an 77-gon for odd n. 

As noted in Example 2, any regular n-gon has n rotational symmetries. A pentagon 
will have five rotational symmetries of 0°, 12°, 144°, 216°, and 288°. However, the 
reflections discussed in Example 2 about opposite sides or opposite corners do not 
exist in the pentagon. Instead, we reflect about an axis of symmetry running from 
one corner to the middle of an opposite side (see Figure 9.3). There are five such 
reflections, for a total of 10 symmetries. 

In a regular odd n-gon, there are n such flips, along with n rotations. Summing 
rotations and reflections, we find that a regular odd n-gon also has 2n symmetries. 
We leave it as an exercise for the reader to show that these 2n symmetries are all 
distinct. The same argument used in Example 2 shows that there are only these 2n 
symmetries. ■ 

Example 4: Symmetries of a Tetrahedron 

Describe the symmetries of a tetrahedron. 

A tetrahedron consists of four equilateral triangles that meet at six edges and 
four corners (see Figure 9.4). Besides the motion leaving all corners fixed — call it 
the 0° motion — we can revolve 120° or 240° about a corner and the center of the 
opposite face (see Figure 9.4 a), or we can revolve 180° about the middle of opposite 
edges (see Figure 9 Ah). Since there are four pairs of a corner and opposite face 
and three pairs of opposite edges, we have a total of 1 (0° motion) + 4x2 + 3= 12 
symmetries. It is left as an exercise to check that these 12 symmetries are distinct 
and that no other symmetries exist. ■ 




Figure 9.4 Symmetric revolutions of 
a tetrahedron 


(a) 


(b) 
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The symmetries of a square are naturally characterized by the way they permute 
the corners of the square. Thus, the 180 ° rotation n 3 (see Figure 9 . 2 ) can be described 
as the corner permutation: a— y c,b — y d,c — y a, d — > in tabular form, we write 
(c da t)' The 90 ° rotation it2 can be described as a — »■ b. b — »■ c, c — > d . d — »■ a, or 
a — y b — y c — y d — y a. 

A permutation of the form xi —y X2 — > X3 ■ ■ ■ — y x„ —y x\ is called a cyclic per- 
mutation or cycle. Thus, 7t2 is a cycle of length 4 . Cycles are usually written 
in the form (x 1 X2X3 . . .x n x\). So 7T2 = (abed). Any permutation can be expressed 
as a product of disjoint cycles (proof of this claim is an exercise). For example, 
7Z3 = ( ac)(bcl ), 7T4 = ( adeb ), and rtj = ( a)(bd)(c ). The reader should find cycle de- 
compositions for the other tt,- s. The depiction of a motion as in Figure 9 . 2 , with arrows 
indicating the mapping at each corner, will make it easier to trace out cycles in later 
calculations. 

In permuting the corners, the symmetries create permutations of the colorings of 
the corners. For example, if C/ is the ith square in Figure 9 . 1 , then n 3 is the following 
permutation of colorings: 

_ f C\ C2 C3 C4 C5 Ce C7 C% C9 C10 C 11 C12 C13 C14 C15 Ci6 
3 \Ci C4 C5 C2 C3 C% Cg C(, C7 C 10 C 11 Ci4 Ci5 C12 Ci3 C 16 

The point is that while a symmetry 7 r,- is easily visualized by how it moves the corners 
of the square, what we are really interested in is the way 7 r,- takes one coloring into 
another (making them equivalent). Thus, we formally define our coloring equivalence 
as follows: 



Colorings C and C' are equivalent, C ~ C ' , 

if there exists a symmetry n, such that 7 r;(C) = C' ( 2 ) 

The properties of the set G of symmetries that interest us are the ones that make 
the relation C ~ C' in Eq. ( 2 ) an equivalence relation. These properties of G are (here 
71 / ■ 7 T j means applying motion tc ,■ followed by motion Jtj ): 

1 . Closure: If tt,- , 7 r ; e G, then 7 r; ■ Jtj e G ; for example, in Figure 9 . 2 , tt 2 ■ 7 r 5 = ttj. 

2 . Identity: G contains an identity motion 7 t\ such that tT| • tt, = 7 r, and itj ■ tti = tt,- ; 
in Figure 9 . 2 , 7 Ti is tti . 

3 . Inverses: For each 7 r ( - e G, there exists an inverse in G, denoted 7^, such that 
7T ; _1 ■ Tti = JTi and Tti • TT/ = 7Ti; for example, in Figure 9 . 2 , rtf 1 = Tt 4. 

Observe that closure makes our coloring relation ~ satisfy transitivity [property 
(i) of an equivalence relation]. For suppose C ~ C' and C' ~ C". Since C ~ C', there 
must exist tt, e G such that tt,- (C) = C' . Similarly, there i s a tt e G such that Tt j (C') = 
C" . Then by closure, there exists 7 T* = 7 r,- - rtj e G with Ttk(C) = (Tti • ttj)(C ) = C". 
Thus C ~ C" . Similarly, properties ( 2 ) and ( 3 ) of the symmetries imply that our col- 
oring relation satisfies properties (ii) and (iii) of an equivalence relation, respectively 
(see Exercise 16 ). 
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A collection G of mathematical objects with abinary operation is called a group if 
it satisfies properties 1 , 2, and 3 along with the associativity property — in, ■ Jtj) ■ Jtk = 
Ttj ■ (: Xj ■ jtk). Thus we have the following theorem. 

Theorem 

Let G be a group of permutations of the set S (corners of a square) and T be any 
collection of colorings of S (2-colorings of the corners). Then G induces a parti- 
tion of T into equivalence classes with the relation C ~ C' o some it e G takes 
C to C. 

Note that S could be any set of objects and T could be any possible collection of 
colorings. The following simple lemma about groups lies at the heart of the counting 
formula developed in the next section. 

Lemma 

For any two permutations n, , jtj in a group G, there exists a unique permutation 
jt k = jrr * 1 • 7 : j in G such that jr,- • jt k = ttj. 

Proof 

First we show that jtt ■ tt k = ttj. Since n k = jtf 1 • Ttj, 

jtj ■ jtk = Tti ■ (rtf 1 - Ttj) = (jti -nf 1 ) ■ Jtj (by associativity) 

— T X\ • Ttj — Ttj 

as claimed. Next we show that n k is unique. Suppose there also exists a permutation 
n' k such that jtj ■ 7t k = Ttj. Then jtj ■ Tt k = Tti ■ Jt' k . Multiplying the equation by rtf , we 
have 


Ttj ' (jCi • 7t k ) — Ttj • (jt i • Tt k ) => (jtj • Tt[) ' Tt k — (j tj • Ttf) • 7 t k 

— 7 ^ Tt\ " Ttk ~~ ^1 * 7tfc k ^ 



9.1 EXERCISES 


Summary of Exercises The first 1 3 exercises continue the exam- 
ples of equivalence and symmetries given in this section. The remaining exercises 
develop basic aspects of group theory associated with symmetries. Prior experience 
with modern algebra is needed for most of these latter problems. 


1. Which of the following relations are equivalence relations? State your reasons. 

(a) < (less than or equal to), for a set of numbers 

(b) = (equal to), for a set of numbers 

(c) “Difference is odd,” for a set of numbers 

(d) “Being blood relations,” for a group of people 

(e) “Having a common friend,” for a group of people 
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2. Which of the following collections with given operations are groups? For those 
collections that are groups, which elements are the identities? 

(a) The nonnegative integers 0, 1,2,... with addition 

(b) The integers 0, 1, 2, . . ., n — 1 with addition modulo n 

(c) All polynomials (with integer coefficients) with polynomial addition 

(d) All nonzero fractions with regular multiplication 

(e) All invertible 2x2 real- valued matrices with matrix multiplication 

3. Find all symmetries of the following figures (indicate with arrows where the 
comers move in each symmetry, as in Figure 9.2): 


(a) ( b ) b (c) 



4. Write the following symmetries or permutations as a product of cyclic permuta- 
tions: 

(a) 7T 2 (b) 7T 3 (c) TT 6 (d) 7TI 

(e) 7T 4 • 7T 7 (f) 7T 7 • 7T 4 (g) ( 3546712 ) 

5. For the symmetries of the square listed in Figure 9.2, give the associated permu- 
tation of 2-colorings, as in (1), for 

(a) tti (b) 7 r 2 (c) 7t 5 (d) n 1 

6. (a) List all 2-colorings of the three corners of a triangle. 

(b) For the following symmetries of a triangle, give the associated permutation 
of 2-colorings, as in (1), for 

(i) 7t = 120° rotation (ii) it = flip about vertical axis 

7. Show that the eight symmetries of a square listed in Figure 9.2 are all distinct. 

8. Show that the 2n symmetries of an n-gon (even or odd) mentioned in Examples 
2 and 3 are all distinct. 

9. Show that the 12 symmetries of a tetrahedron listed in Example 4 are all distinct. 
Show that there are exactly 12 symmetries of a tetrahedron. 

10. Find the symmetry of the square equal to the following products (remember that 
Tti ■ Ttj means applying motion tt, followed by motion itj): 

(a) 712-^4 (b) 7T2 • 7T 5 (c) 7r 7 • (7T 2 • 7T 8 ) (d) (7T 7 • 7T 6 ) • 7T 3 

11. (a) Write out the 6x6 multiplication table for the product of all pairs of sym- 

metries of a triangle. 

(b) Repeat part (a) for integers 1, 2, 3, 4 with multiplication modulo 5. 
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(c) Repeat part (a) for the following group of permutations of 1, 2, 3, 4: 

/1234\/1234\/ 1 234\/1234\ 

\123 4y\2143y\3412y\4 32iy 

12. Find two symmetries of the square tt; , ttj such that tt,- • tt ; ^ Jtj ■ tt, (this means 
the group of symmetries of a square is noncommutative). 

13. Many organic compounds consist of a basic structure formed by carbon atoms 
(carbon atoms are the corners of a floating figure), plus submolecular groups 
called radicals that are attached to each carbon atom (the carbons are like cor- 
ners and the radicals like colors). Suppose such an organic molecule has six 
carbon atoms, with four radicals of type A and two radicals of type B. Suppose 
this molecule has three isomers — that is, three different ways that the two types 
of radicals can be distributed. Which of the following two hypothetical carbon 
structures would have three isomers with four As and two B s? 



14. Prove that an equivalence relation partitions a set into disjoint subsets of mutually 
equivalent elements. 

15. Give a procedure for decomposing any permutation into a product of cycles. 

16. Show that properties (2) and (3) of a group G of permutations imply properties 
(ii) and (iii), respectively, of the associated equivalence relation defined in the 
theorem. 

17. Let S be a set and G a group of permutations of S. For any two subsets Si , S 2 of 
S, define Si ~ S 2 to mean that for some jr e G, Si = 7 r(S 2 ) (= {tt(s) | s e S 2 }). 
Show that ~ is an equivalence relation. 

18. Prove that the set of permutations of the 2-colorings of a square [see (1)] forms 
a group. 

19. Prove that for any prime p, the integers 1,2,...,/?— 1 with multiplication modulo 
p form a group. 

20. A subset of elements in a group G is said to generate the group if all elements in 
G can be obtained as (repeated) products of elements in the subset. 

(a) Which of the following subsets generate the group of symmetries of a square? 

(i) Tt] , 7 t 2 , it 3 (ii) 7t 2 , 7t 5 (iii) tt 3 , tt 6 (iv)jr 6 , 7 r 7 

(b) Show that the group of symmetries of a regular n- gon can be generated by a 
subset of two elements. 

21. If a subset G' of elements in a group G is itself a group, then G' is called a 
subgroup of G. 


9.2 Burnside’s Theorem 


363 


(a) Show that G' = [jti, Jt2, tt^} and G" = are subgroups of the 

group G of symmetries of a square. 

(b) Find another 4-element subgroup of G containing jt?, . 

(c) Find all subgroups of the group G of symmetries of a square. 

22. (a) How many different binary relations on n elements are possible? 

(b) How many symmetric binary relations are possible? 

23. Show that jt/ ~ : Xj if there exists tt e G such that 7T,- = jt~ l Jtjit is an equivalence 
relation. 

24. A transposition is a cycle of size 2 — that is, a permutation that interchanges the 
positions of just two elements (and leaves all other elements fixed). Show by 
induction that any permutation of n elements can be written as a composition of 
transpositions. 

25. For a given group G of n elements, define the function f n on G as follows: for 
each n ' e G, f n {n') = Jt -n' . 

(a) Show that f n is a one-to-one mapping for any n e G. 

(b) If we define f n *fn 2 — fn v n 2 , show that the set {/„■} forms a group. 



9.2 BURNSIDE’S THEOREM 

We now develop a theory for counting the number of different (nonequivalent) 
2-colorings of the square. More generally, in a set T of colorings of the corners 
(or edges or faces) of some figure, we seek the number N of equivalence classes of T 
induced by a group G of symmetries of this figure. 

Suppose there is a group of s symmetries acting on the c colorings in T. Let Ec 
be the equivalence class consisting of C and all colorings C' equivalent to C — that is, 
all C' such that for some n e G, jt(C) — C' . If each of the .v jts takes C to a different 
coloring Jt(C), then E c would have s colorings. Note that the set of 7t(C)s includes 
C since 7Ti(C) = C (it] is the identity symmetry). If every equivalence class is like 
this with s colorings, then 

sN = c: (number of symmetries) x (number of equivalence classes) 

= (total number of colorings) 

Solving for N, we have N = c/s. 

Consider, for example, the c = n\ oriented seatings of n people around a round 
table. There are s = n cyclic rotations of the seatings, and each equivalence class 
consists of n seatings. Thus, the number of equivalence classes (cyclicly nonequivalent 
seatings) is N — n\/n = (n — 1)! 

On the other hand, suppose we have a small round table with three positions for 
chairs (each 120° apart), and white and black chairs are available. There are 2 3 = 8 
ways to place a white or black chair in each position. See Figure 9.5. There are three 
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Multiplicity 3 111 



Figure 9.5 Multiplicity 1113 

cyclic rotations of the table possible, 0° , 1 20° , and 240° . We have c — 8 “colorings” and 
s = 3 symmetries, but the number of equivalence classes cannot be N = |, a fraction! 

It is true that the three arrangements of one black and two white chairs (or 
vice versa) form an equivalence class, since 0°, 120°, and 240° rotations move the 
one black chair to different positions. However, an arrangement of three black chairs 
(or three white chairs) forms an equivalence class by itself. See Figure 9.5. Any 
rotation maps this arrangement of three black chairs into itself, that is, leaves it fixed. 

We need to correct the numerator in the formula N — c/s by adding the multiplic- 
ities of an arrangement — that is, when one or more symmetries map the arrangement 
to itself instead of to other arrangements. In this way, when multiplicities are counted, 
every equivalence class will have 5 members. Since two symmetries, along with the 0° 
symmetry, leave the all-black-chair arrangement fixed and similarly for the all-white, 
then counting arrangements in Figure 9.5 with multiplicities we have the correct 
answer 

Af = (3+1 + 1 + 1 + 1 + 1 + 1+ 3)/3 = 12/3 = 4 

The “multiplicity” correction is even more complicated for 2-colorings of our 
square. Here the size of an equivalence class of colorings may be 1 or 2 or 4, but never 
8 (= the number of symmetries of the square). The first problem is that several its, 
besides the identity symmetry 7Ti , may leave a coloring C, fixed — that is, ir(C'i) = Cj . 
The other problem is that if Q is another coloring in C ( - ’s equivalence class, there may 
be several its all taking C, to Q. For example, the coloring C\q (see Figure 9.1) is 
fixed by symmetries it\, it-}, iti, its, and is mapped to Cn by symmetries m, it a, its, 
it (, . 

By the lemma in Section 9.1, the symmetries it 2 , it 4 , its, it(, taking C io to Cn 
can be written in the form it = it 2 - it ' , where it' is a symmetry that leaves Cu fixed, 
or else it 2 followed by it' would not take C io to Cn . For example, 

( a b c d\ _ f a b c d\ fabcd\ 
bade) \bcda) \cbad ) 

Similarly 7r2 = it 2 ■ it\ , itA — it 2 ■ its, He = it 2 ■ iti- Conversely, given any it* that 
leaves Cn fixed, it 2 ■ it* takes Cio to Cn and so it 2 ■ it* must be one of it 2 , it a, its, it6- 
Thus there is a 1 — 1 correspondence between the its that take Cio to Cn and the 7rs 
that leave Cn fixed. Therefore, to count the colorings in an equivalence class E with 
appropriate multiplicities (i.e., coloring Cn has multiplicity 4 since four different 7rs 
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take Cio to Cn), it suffices to sum over the colorings in E the number of ns that leave 
each coloring fixed. 

In the case of the equivalence class consisting of C 10 and Cn, each of C 10 and Cn 
have multiplicity 4, so that the size of their equivalence class including multiplicities 
is 4 + 4 = 8 (= s, the number of symmetries), as required. 

In general, when multiplicities are counted, each equivalence class E will have 
5 elements. If <p(x) denotes the number of jrs that leave the coloring x fixed, then 

E.v 6 £ = 5 - 

Formal proof that ^, xeE 4>(x) = s: Let x\,X 2 ,...,x m be the colorings in 
equivalence class E and let tt\ . txi, . . . , n s be the group of symmetries. These 7rs 
can be divided into m groups, R \ . If, . . . , R m , where Rj is the set of 7rs that map 
x i to Xj. As shown above for x\ = C io and x, = Cn, the number of jrs mapping xi 
to Xi equals <p(xj), the number of jrs that leave X; fixed. Thus, Eve£ <A(x) sums the 
number of 7rs in R\, in Ri, . . . , and in R m . But the sum of the Rs is just the total 
number of symmetries, s. 

Summing over all equivalence classes, we obtain the following theorem, first 
proved by Burnside more than 100 years ago. 

Theorem (Burnside, 1897) 

Let G be a group of permutations of the set S (corners of a square). Let T be any 
collection of colorings of S (2-colorings of the corners) that is closed under G. Then 
the number N of equivalence classes is 



or 



(*) 


where |G| is the number of permutations and 'F(jr) is the number of colorings in T 
left fixed by tx . 

By “closed under G,” we mean that for all tx e G and x e T . jr(x) e T . This 
closure property is automatic when T is the set of all corner 2-colorings of the square. 
In Section 9.4 we need to apply formula (*) to special subsets of S , such as the set of 
colorings with two corners black and two corners white, that are closed under G. 

The two sums in the theorem both count all instances of some coloring being 
left fixed by some tx, the first sums over the different colorings, the second sums over 
different 7rs. Formula (*) will turn out to be more useful in later computations. 

We informally summarize the spirit behind formula (*) as follows. The total 
number c of all colorings of our square is equal to ^(jrG, since the identity symmetry 
jti leaves all colorings fixed. If each of the eight symmetries mapped a coloring C into 
eight different colorings, then we would have eight colorings in each equivalence class 
and hence a total c/8 = T (jr i )/ 1 G equivalence classes. However, for any coloring 
C, the colorings in the collection of jr(C)s as tx takes on all the different symmetries 
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Figure 9.6 

in G are never distinct. This means that each equivalence class does not contain 8 
elements. The terms ^(tt,) in (*) add the “multiplicities” of repeated colorings so that 
each equivalence class has eight colorings in the sum in (*). 

Example 1: 2-Colored Batons 

A baton is painted with equal-sized cylindrical bands. Each band can be painted 
black or white. If the baton is unoriented as when spun in the air, how many different 
2-colorings of the baton are possible if the baton has (a) 2 bands? (b) 3 bands? 
(c) 4 bands? 

The batons with two bands, three bands, and four bands are pictured in Figure 9.6. 
Irrespective of the number of bands, there are two symmetries of a baton: tx\ is a 0° 
revolution of the baton — n\ is the identity symmetry — and 1 x 2 is a 1 80° revolution of 
the baton. 

(a) For the 2-band baton, the set of 2-colorings left fixed by tc\ is all 2-colorings 
of the baton. There are 2 2 = 4 2-colorings, and so VF (tv t ) = 4. The set of 2-colorings 
left fixed by 1 x 2 consists of the all-black and all-white coloring, and so T (jr 2 ) = 2. 
By Burnside’s theorem, the number of different colorings is \ ['F(tti) + (tt 2 J] = 
1(4 + 2) = 3. 

(b) For the 3-band baton, all 2 ' 2-colorings are left fixed by Jt \ , and so '+ ( tt 1 ) = 

2 3 = 8. The set of 2-colorings left fixed by 7 x 2 can have any color in the middle band 

(band 2) and a common color in the two end bands, and so +(+2) = 2x2 = 4. The 
number of different colorings is ^ ) + v k(n - 2)] = ^(8 + 4) = 6. 

(c) For the 4-band baton, all 2 4 2-colorings are left fixed by JT| , and so '+ f tt 1 ) = 

2 4 = 16. The set of 2-colorings left fixed by 7 x 2 have a common color for the end 
bands and a common color for the inner bands, so +(712) = 2x2 = 4. The number 
of different colorings is | ['F(tti) + rF^)] = ^(16 + 4) = 10. ■ 

Example 2: 3-Colored Batons 

How many different 3-colorings of the bands of an n-band baton are there if the baton 
is unoriented (as in Example 1)? 

As in Example 1, the symmetries of the baton are a 0° revolution and a 180° 
revolution. We apply formula (*) for the group of the 0° and 180° revolution acting 
on an n-band baton with three colors. There are 3" colorings of the fixed baton and so 
+(()") = 3". The number of colorings left fixed by a 180° spin depends on whether n 
is even or odd. 

If n is even, each of the n /2 bands on one half of the baton can be any color — 3"^ 2 
choices — and then for the coloring to be fixed by a 1 80° spin, each of the symmetrically 
opposite bands must be the corresponding color. So *F( 1 80° ) = 3" ,/2 and we have from 
formula (*): N = ±(3 n + 3 n/2 ). 


1 

2 

3 

4 


(a) 


(b) 


(c) 
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To enumerate batons left fixed by a 180° spin when n is odd, we can use any 
color for the “odd” band in the middle of the baton — three choices. Each of the 
(n — l)/2 bands on one side of the middle band can be any color — 3 f "' 1 1/2 choices — 
and again the other (n — l)/2 bands must be colored symmetrically. So fl/((] 80") = 
3 x 3 (,, - 1)/2 = 3<«+b/2 and N = ±(3 n + 3 (,!+1)/2 ). ■ 


Example 3: 3-Colored Necklaces 

Suppose a necklace can be made from beads of three colors — black, white, and red. 
How many different necklaces with n beads are there? 

When the n beads are positioned symmetrically about the circle, the beads occupy 
the positions of the corners of a regular n-gon. Thus, our question asks for the number 
of corner colorings of an n-gon using three colors. The answer depends on what is 
meant by “different.” If the beads are not allowed to move about the necklace — that 
is, the n-gon is fixed — the answer is 3 x 3 x • • • x 3 = 3" (three color choices at each 
of n corners). A more realistic interpretation of our problem would allow the beads 
to move freely about the circle — that is, the n-gon rotates freely (but in this case flips 
will not be allowed). We employ formula (*) to count the number N of equivalence 
classes of these 3-colorings induced by the rotational symmetries of an n-gon. We 
shall now do the calculations for n — 3. A more general technique for larger n is 
developed in the next section. 

There are 3 * 1 2 3 = 27 3-colorings of a 3-bead necklace, and three rotations of 0°, 
120°, 240°. The 0° rotation leaves all colorings fixed, and so T(0°) = 27. The 120° 
rotation cannot fix colorings in which some color occurs at only one corner. It follows 
that the 120° rotation fixes just the monochromatic colorings. Thus, T ( 1 20° ) = 3. The 
240° rotation is a reverse 120° rotation, and so Vk(240°) = 3. By formula (*), we have 

N= ^(27 + 3 + 3)= 11 ■ 



9.2 EXERCISES 


Summary of Exercises The exercises continue the application of 
Burnside’s theorem to count colorings introduced in Examples 1, 2, and 3. 


1. How many different n-bead necklaces are there using beads of red, white, blue, 
and green (assume necklaces can rotate but cannot flip over)? 

(a) n = 3 (b) n = 4 

2. How many different ways are there to place a diamond, sapphire, or ruby at each 

of the four vertices of this pin? 
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3. Fifteen balls are put in a triangular array as shown. How many different arrays 
can be made using balls of three colors if the array is free to rotate? 



4. How many different ways are there to 2-color the 64 squares of an 8 x 8 chess- 
board that rotates freely? 

5. A merry-go-round can be built with three different styles of horses. How many 
five-horse merry-go-rounds are there? 

6. A domino is a thin rectangular piece of wood with two adjacent squares on one 
side (the other side is black). Each square is either blank or has 1, 2, 3, 4, 5, or 6 
dots. 

(a) How many different dominoes are there? 

(b) Check your answer by modeling a domino as an (unordered) subset of two 
numbers chosen with repetition for 0, 1,2, 3, 4, 5, 6. 

7. Two /7-digit decimal sequences consisting of digits 0, 1, 6, 8, 9 are vertically 
equivalent if reading one upside-down produces the other — that is, 0068 and 
8900. How many different (vertically inequivalent) /7-digit (0, 1 , 6, 8, 9) sequences 
are there? 

8. How many different ways are there to color the five faces of an unoriented pyramid 
(with a square base) using red, white, blue, and yellow? 

9. (a) Find the group of all possible permutations of three objects. 

(b) Find the number of ways to distribute 12 identical balls in three indistin- 
guishable boxes. [Hint: First let boxes be distinct, and then use part (a).] 

10. How many ways are there to 3-color the n bands of a baton if adjacent bands 
must have different colors? 

11. How many ways are there to 3-color the corners of a square with rotations and 
reflections allowed if adjacent corners must have different colors? 

12. (a) How many ways are there to distribute 12 red jelly beans to four children, a 

pair of identical female twins and a pair of identical male twins? 

(b) Repeat part (a) with the requirement that each child must have at least one 
jelly bean. 

13. (a) Show that for any given /, the subset G, of symmetries of the square that leave 


coloring C, (of Figure 9.1) fixed is a group (a subgroup of all symmetries). 

(b) Find G 4 . 

(c) Find Gy. 
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9.3 THE CYCLE INDEX 

Without further theory, we would find it very difficult to apply Burnside’s theorem to 
counting different colorings of an unoriented figure. Recall that Burnside’s theorem 
in Section 9.2 says that the number N of equivalence classes in a set T of colorings of 
a figure with respect to a group of symmetries G is 

1 \ - 

N — > 'h(Tr) 

IGI ko 

where 'F(n) is the number of colorings in T left fixed by n . If the set T were all 
3-colorings of the corners of a 10-gon or a cube, it would seem close to impossible 
to determine 'f / (7r)s, the number of colorings left fixed by various symmetries it of 
the figure. However, we shall show that vj/ (jr ) can be determined easily from the 
structure of it. We develop the theory for this simplified calculation of (jr ) in terms 
of 2-colorings of a square. 

Let us apply Burnside’s theorem to the 2-colorings of the square. Initially we 
determine the number of 2-colorings left fixed by motion 7 r,- by inspection (using 
Figures 9.1 and 9.2). As we count ^(tt,) for each tt,, we look for a pattern that would 
enable us to predict mathematically which colorings must be left fixed by each ;r; . 
It is helpful to make a table of the jt,-s and the colorings that they leave fixed; see 
columns (i) and (ii) in Figure 9.7 [column (iii) is developed later]. 

The 0° rotation tc\ leaves each corner fixed and hence it leaves all colorings fixed. 
So = 16. The 90° rotation Tt 2 cyclicly permutes corners a, b, c, d. Being left 
fixed by the 90° rotation means that each corner in a coloring has the same color after 
the 90° rotation as it did before. Since tti takes a to /;, then a coloring left fixed by Jt 2 
must have the same color at a as at b. Similarly, such a coloring must have the same 
color at b as at c, the same color at c as at d, and the same at d as at a. Taken together, 
these conditions imply that only the colorings of all white or all black corners, C\ 


(0 

Motion 

(ii) 

Colorings Left 

Fixed by ttj 

(iii) 

Cycle Structure 
Representation 

7 Ti 

16 — all colorings 

4 

7*2 

2 — Ci , Ci6 

X4 

7 T 3 

2 — Ci, C10, Cn, Ci6 

x\ 

7?4 

2 — Ci, Ci6 

x 4 

7 T 5 

2 — Ci, Cg, Cg, Ci6 

x\ 

776 

2 — Ci, C7, C9C16 

x\ 

7 T 7 

8 — Ci, C2, C4, C10 

Cll, C12, C14, Ci6 

X1X2 

7*8 

8 — Ci, C 3 , C5, C10 

Cn, C13, C15, Ci6 

xjx 2 


Figure 9.7 
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and C i 6 , are left fixed. Thus T( 7 r 2 ) = 2. In general, a coloring C will be left fixed by 
tt if and only if for each corner v, the color of C at v is the same as the color at tx(v) 
so that the symmetry leaves the color at tx(v) unchanged. 

Next we consider the 180° rotation n 3 . Looking at the depiction of 713 in 
Figure 9.2, we see that 773 causes corners a and c to interchange and corners b and d to 
interchange. It follows that a coloring left fixed by 773 must have the same color at cor- 
ners a and c and the same color at b and d (no further conditions are needed). With two 
color choices for a, c and with two color choices for b, d, we can construct 2x2 = 4 
colorings that will be left fixed — namely, C 1 , C 10 , C n, C 16 . Hence ( tts ) = 4. 

Symmetry 714 is a rotation of 270° or —90°, and so the symmetry is similar to 7 r 2 . 
Hence — ^( 7 x 2 ) = 2. The horizontal rotation 7 x 5 interchanges corners a, b and 

interchanges corners c, d. Like the 1 80° rotation 7 x 3 , symmetry 7 X 5 will leave a coloring 
fixed if corners a and b have the same color — two choices, white or black — and corners 
c and d have the same color — two choices. Then like 773 , we have TIttV) = 2x 2 = 4 
colorings fixed, namely, C\, C^, Cg, Ci 6 - 

A pattern is becoming clear. The reader should now be able quickly to pre- 
dict that 7T6 will also leave 2x2 = 4 2-colorings of the square fixed, for again this 
symmetry interchanges two pairs of corners. Formally, an interchange is a cyclic per- 
mutation on two elements. All our enumeration of fixed colorings has been based on 
the fact that if 7T; cyclicly permutes a subset of corners ( that is, the corners form a 
cycle ofxXi), then those corners must all be the same color in any coloring left fixed 
by 7 Xj. 

As mentioned in Section 9.1, any tt, can be represented as a product of disjoint 
cycles. For example, txt, = ( ac)(bd ) and 7 x 4 = ( adcb ). For each symmetry, we need to 
get such a cyclic representation and then count the number of ways to assign a color 
to each cycle of corners. For future use, let us also classify the cycles by their length. 
It will prove convenient to encode a symmetry’s cycle information in the form of a 
product containing one 44 for each cycle of 1 corner in 7r, , one x 2 for each cycle of 
size 2, and so forth. This expression is called the cycle structure representation of a 
symmetry. 

The cycle structure representation of 7 x 2 and tt 4 is x 4 , since each consists of one 
4-cycle: 7 x 2 = (abed) and 7 r 4 = (adcb). The cycle structure representation for 713 , jx<,, 
and 7 T 6 is x 2 .r 2 or x\, since each consists of two 2-cycles. Column (iii) in Figure 9.7 
gives the cycle structure representation of each symmetry. 

What about tti? Previously, it sufficed to say that 7 Ti leaves all colorings fixed. 
Now it is time to point out that a corner left fixed by a permutation is classified as a 
1-cycle. Thus 7 ti consists of four 1-cycles. Its cycle structure representation is then xxf . 
We “predict” that 7 Ti leaves 2 4 =16 colorings fixed — that is, 7 T] leaves all 2-colorings 
fixed. 

For any symmetry tx of any figure, the number of colorings left fixed will be given 
by setting each xj equal to 2 (or, in general, the number of colors available) in the 
cycle structure represen tation of tx, that is, 


^ ^ ^ 2 number c y cles i n n 
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Let us apply this theory to and tt s . For each of these reflections, the cycle structure 
representation is seen to be x\x 2 (follow the arrows in Figure 9.2), and thus for each 
we can find 2 2 x 2 = 8 colorings that are left fixed. Note that for 180° flips around 
opposite corners and midpoints of opposite edges, all comers will be in cycles of size 
2, unless a corner is left fixed by the flip. 

To obtain the number of different 2-colorings of the floating square with 
Burnside’s Theorem, we sum the numbers in column (ii) of Figure 9.7 and divide 
by 8: 

7^77 V '*(n) = 7(16 + 2 + 4 + 2 + 4 + 4+8 + 8)= 7 (48) = 6 

neG 8 8 

There is a slightly simpler way to get this result. First, algebraically sum the cycle 
stmeture representations of each symmetry, collecting like terms together, and then 
divide by 8. From column (iii) of Figure 9.7, we obtain g(Xj + 2x4 + 3x\ + 2x 2 xi). 
This expression is called the cycle index Pc(x 1, X 2 , ■ ■ . , x&) for a group G of sym- 
metries. By setting each x,- = 2 in this cycle index — that is, /+(2, 2, . . . , 2) — we get 
the same answer. (Before, the steps were reversed: we first set x,- = 2 in each cycle 
structure representation and then added.) 

Suppose that instead of two colors, we had three colors. Then the same reasoning 
applies, but now there are three choices for the color of the corners in each cycle. If 
a symmetry has k cycles, then it will leave 3* 3-colorings of the square fixed, and the 
number of different 3-colorings will be Pq( 3,3, ... ,3). More generally, for any m. 
Pc (m, m , . . . , m) will be the number of nonequivalent /72-colorings of an unoriented 
square. The argument used to derive this coloring counting formula with the cycle 
index of a square is valid for colorings of any set with associated symmetries. 

Theorem 

Let S be a nonempty set of elements and G be a group of symmetries of S that acts 
to induce an equivalence relation on the set of //7-colorings of S. Then the number of 
nonequivalent ///-colorings of S is given by Pc(m, ///, . . . , ///). 

Example 1: Coloring Necklaces 

Use this theorem to re-solve the problem of Example 3 in Section 9.2 of counting 
//-bead necklaces with black, white, and red beads. 

Recall that beads on the necklace can rotate freely around the circle, but that 
reflections are not permitted, and that the number of different 3-colored strings of n 
beads is equal to the number of 3-colorings of a cyclicly unoriented //- gon. For n — 3, 
the rotations are of 0°, 120°, and 240° with cycle structure representations of x 3 , X3, 
and X3, respectively. Thus, Pq = |(x 3 + 2x3). The number of 3-colored strings of 
three beads is Pc (3, 3, 3) = |(3 3 + 2 x 3) = 11. More generally, the number of m- 
colored necklaces of three beads is Pc(m , ///, m) = |(m 3 + 2m). 

Let us try a more complicated case: // = 8 (see Figure 9.8). The rotations are 
of 0°, 45°, 90°, 135°, 180°, 225°, 270°, and 315°. The 0° rotation consists of eight 
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1-cycles. The 45° rotation is the cyclic permutation ( abcdefgh ). The 90° rotation has 
the cycle decomposition ( aceg ) ( bdfh ). The 135° rotation is the cyclic permutation 
(adgbehcf). The 180° rotation has the cyclic decomposition (ae) ( bf ) (eg) (dh). The 
cycle structure representations are thus 0° rotation, x 8 , 45° rotation, x%, 90° rotation, 
x|; 135° rotation, xg; and 180° rotation, x\. The 225°, 270°, and 315° rotations are 
reverse rotations of 135°, 90°, 45°, respectively, and have the corresponding cycle 
structure representations. Collecting terms, we obtain 


Pc = \ (*i + 4xg + 2x4 + x 4 ) 

O 

The number of different m-colored necklaces of eight beads is 

-(m 8 + 4 m + 2 m 2 + m 4 ) 

8 


For m = 3, we have 

-(3 8 + 4 x 3 + 2 x 3 2 + 3 4 ) = -(6561 + 12+ 18 + 81) = 834 ■ 

8 8 

There are two helpful observations about rotations of an n-gon. First, for a given 
rotation, all its cycles will be of the same size, since it does not matter at which corner 
you start the rotation’s cyclic permutation of the corners. It follows that the size of any 
rotation’s cycles must divide 11 , the number of corners. This tells us what the possible 
sizes of cycles can be for a rotation. For example, for a 9-gon, the only possible cycle 
size for any rotation is 1, 3, or 9 (the divisors of 9). 

Second, for any rotation, the cycle size will be the minimum number of times we 
must iterate this rotation to generate a cumulative rotation that is a multiple of 360°, 
since to map a corner back to its original position (completing a cycle), a rotation 
must move a corner a multiple of 360°. If a rotation of an n-gon moves each corner k 
places clockwise around the n- gon, then the length of the rotation’s cycle will be the 
minimum times we must iterate this rotation so that the repeated shifts of k places 
equal a multiple of n. In particular, if k has no common divisors with n, the rotation 
will form a single cycle of length n. For example, the 135° degree rotation of the 8-gon 
in Example 1 moved each corner three places clockwise. Since the smallest number 
of 3s that sum to a multiple of 8 is eight 3s (3 x 8 = 24), the 135° rotation creates a 
single cycle of length 8. 
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Example 2: Coloring Corners of a Tetrahedron 

Use the theorem to determine the number of 3-colorings of the four corners of a 
floating tetrahedron. In Example 4 in Section 9.1 we listed the 12 symmetries of the 
tetrahedron (see Figure 9.4 in Section 9.1): the 0° revolution, the eight revolutions 
of 120° and 240° about a corner and the middle of the opposite face, and the three 
revolutions of 180° about the middle of opposite edges. 

The 0° revolution has the cycle structure representation x 4 . The 120° revolution 
about corner a and the middle of face bed has the cyclic decomposition ( a)(bcd ) 
and its cycle structure representation is X 1 X 3 . By symmetry, the other 120° and 240° 
revolutions have this same cycle structure representation. The 180° revolution about 
the middle of edges ab and cd has the cyclic decomposition (ab)(cd ) and its cycle 
structure representation is xf. By symmetry, the other 180° revolutions have the same 
cycle structure representation. Thus we have 

Pg = - ;(4 + 8 x 1 x 3 + 3xf) 

The number of different corner 3-colorings is 

P G ( 3, 3, 3, 3) = ^(3 4 + 8x3x3 + 3x 3 * 1 2 3 ) 

1 

= —(81 +72 + 27)= 15 ■ 



9.3 EXERCISES 


Summary of Exercises The first 1 1 exercises count distinct color- 
ings of unoriented figures. The remaining problems involve associated theory. Note 
that “floating” means that all possible rotations and reflection are allowed. 


1. How many ways are there to color the corners of a floating square using four 
different colors? 

2. How many ways are there to 4-color the corners of a pentagon that is 

(a) Distinct with respect to rotations only? 

(b) Distinct with respect to rotations and reflections? 

3. How many ways are there to 3-color the corners of a hexagon that is 

(a) Distinct with respect to rotations only? 

(b) Distinct with respect to rotations and reflections? 

(c) Find two 3 -colorings of the hexagon that are different in part (a) but equivalent 
in part (b). 
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4 . How many different /(-bead necklaces (cyclicly distinct) can be made from three 
colors of beads when 

(a) n = 7 (b) n = 9 (c) n = 10 (d)n = ll 


5. Find the number of different ///-colorings of the vertices of the following floating 
figures. 


(a) 


a 



a b c d 


h g f « 


(b) 


a b 


i f 




\ /: 


6. (a) Find the cycle index for the group of symmetries of a square in terms of 
permutations of edges, not corners. 

(b) How many ways are there to 3-color the edges of a floating square? 

(c) How many ways are there to 3-color both edges and comers of a floating square? 

(d) Why is the following not the case? (number of floating 3-colorings of corners) 
(number of floating 3-colorings of edges) = (number of floating 3-colorings 
of edges and corners) Explain. 


7. How many ways are there to ///-color the edges of the floating figures in Exercise 
5? {Hint: The cycle index now is for symmetries of the edges.) 

8. (a) Find the number of different //-bead 3-colored necklaces (cyclicly distinct ) in 

which each color appears at least once when (i) n = 3, (ii) n — 4, (iii) n —1 . 
(b) Repeat part (a) when necklaces may reflect as well as rotate. 

9. Find the number of different 2-sided dominoes (two squares of 1 to 6 dots or a 
blank on each side of the domino). 

10. (a) Let G be the group of all 4! permutations of 1, 2, 3, 4. Find Pc. 

(b) U se part (a) to find the number of ways to paint four identical marbles each one 
of three colors (check your answer by modeling this problem as a selection- 
with-repetition problem). 

(c) Use part (a) to find the number of ways to put 12 balls chosen from three 
colors into four indistinguishable boxes with three balls in each box. 
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11. (a) Find the number of 2 x 4 chessboards distinct under rotation whose squares 

are colored red or black. 

(b) Suppose that two chessboards are also considered equivalent (aside from 
rotational symmetry) if one can be obtained from the other by complementing 
red and black colors. How many different 2x4 chessboards are there? 

12 . Show that if x\' ■ ■ ■ x^ 1 is the cycle index for a symmetry n of an n- gon 

(expressed in terms of a permutation of the corners), then lAq + 2ki H — • + 
mk m = n. 

13 . In solving for the number of corner 2-colorings of some unoriented figure, sup- 
pose we are given the cycle index Pc* of the group G* of induced permutations 
of the 2-colorings [as in (1) in Section 9.11, instead of the usual cycle index P G of 
the group of symmetries of the figure. What integer values should be substituted 
now for each x, in P G * to get the number of 2-colorings, or will no substitution 
work? Explain. 

14 . Let S be some set of n objects and G a group of permutations of S. For subsets 
S i , S 2 of S, define the equivalence relation S\ ~ .S'i if for some tt <= G . .S) = tv IS 2 ) 
(= \tt(s) I .v e .S’ 2 } ) - Show that the number of equivalence classes equals 
Pc( 2, 2, . . .). (There are 2" subsets of S in all.) 

15 . Find the number of m-colorings of the corners of a p- gon, where p is a prime 
(P > 2), if 


(a) Only rotations are allowed 

(b) Rotations and reflections are allowed 



9.4 POLYA’S FORMULA 

We are now ready to address our ultimate goal of a formula for the pattern inven- 
tory. Recall that the pattern inventory is a generating function that tells how many 
colorings of an unoriented figure there are using different possible collections of 
colors. For black-white colorings of the unoriented square, the pattern inventory is 
lb 4 + Ib^w + 2 b 2 w 2 + 1 bw 3 + 1 w 4 . For example, the term lb 3 w tells us that there is 
one nonequivalent coloring with three black (b) corners and one white (w) corner. The 
coefficients in the pattern inventory can be viewed as the results of several Burnside 
theorem-type counting problems. Recall that Burnside’s theorem says that the num- 
ber N of equivalence classes in T, a collection of colorings of S (where S is the corners 
of a square or the faces of a tetrahedron, etc.), caused by a group of symmetries of S is 

1 \ - 

N= > V(7T) (*) 

\G\ta 
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In the case of 2-colorings of the floating square, we divide the set of all colorings 
in Figure 9. 1 into sets based on the numbers of black and white corners: 

To = {C,} 

7, = {C 2 , C 3 , C 4 , C 5 } 

Ti = [Co, Ci, Cs, Cg, Cio, Cn} 

*3 = {Cn, C13, C\ 4 , C15} 

Ta = {C 16 } 

The coefficient of bfv can be obtained from (*) if we let the group G of symmetries 
of the square act on just the set 7j. Recall that formula (*) from Burnside’s theorem 
applies to any collection of colorings that is closed under G. The term “closed under 
G” means that for any it e G, and any coloring C e T, ir(C) equals another coloring 
in T. 7j is closed under G, since any symmetry acting on a coloring with three black 
corners and one white corner yields another coloring with three black corners and 
one white corner. The same is true for the other 7) ( s. Thus, the coefficient of b 4 ~ k w k 
in the pattern inventory is the result off*) when T \ is the set on which G acts. 

Let us try to solve these five subproblems simultaneously. In Figure 9.9 we have 
duplicated the table in Figure 9.7 and added a new column (iv). In the first row of 
column (iv), we write a polynomial whose coefficients give the numbers of 2-colorings 
in each T k left fixed by it\ , then in the second row of the table we write a polynomial 
for the numbers of 2-colorings in each T) left fixed by it 2 , then by 713, and so forth. 
Then we total up the b 4 term in each row (the number of 2-colorings with four blacks) 
and divide by 8 to get the coefficient of b 4 in the pattern inventory, total up the b f v 
term in each row and divide by 8 to get the coefficient of ffiw, and so forth. 

Since the action of it\ leaves all C s fixed, the first row’s coefficients are 1, 4, 6, 
4, 1 . We write b 4 + Ab 2 w + 6 b 2 w 2 + 4 bw 3 + w 4 ; this is an inventory of fixed colorings. 
For 7Ti , the inventory of fixed colorings is an inventory of all colorings. Observe that 
this inventory is simply ( b + w) 4 = (b + w)(b + w){b + w){b + w), one ( b + w) for 
each corner. For jt 2 , the inventory is b 4 + w 4 . For it?,, we find by observation that the 
inventory is b 4 + 2 b 2 w 2 + w 4 . This expression factors into ( b 2 + w 2 ) 2 . 


(0 

Motion 
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(*'*') 
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7*1 

1 6 — all colorings 

y 4 

X 1 

0 > + w) 4 

= \b 4 + Ab 2 w+6b 2 w 2 + Abw 2 

+ 1 w 4 

7*2 

2 — Ci, Ci6 

X 4 

(b 4 + w 4 ) 

= lb 4 


+ lw 4 

7*3 

2 — Ci, Cio, Cn, Ci6 

X 2 

X 2 

(b 2 + VF 2 ) 2 

= lb 4 

+ lb 2 w 2 

+ lw 4 

7 T 4 

2 — Ci, Ci6 

X4 

(. b 4 + w 4 ) 

= lb 4 


+ lw 4 

7*5 

2 — Ci, Cs, , Ci6 

X 2 

x 2 

( b 2 + w 2 ) 2 

= lb 4 

+ 2 b 2 w 2 

+ lw 4 

7*6 

2 — Ci, C7, C9, C16 

X 2 

X 2 

( b 2 + w 2 ) 2 

= lb 4 

+ lb 2 w 2 

+ lw 4 

7*7 

8 — Ci, C2, C4, Cio 
Cn, C12, C14, C16 

x\x 2 

(b + w) 2 (b 2 + iv 2 ) 

= lb 4 + 2 b 2 w 

+ 2b 2 w 2 + 2 bw 1 

+ lw 4 

7*8 

8 — Ci, C3, C5, Cio 
Cn, C13, C15, C16 

x\xi 

{b + w) 2 (b 2 +W 2 ) 

= lb 4 + 2 fc 3 w 

+ 2 trw 2 + 2 bw 3 

+ lw 4 


Figure 9.9 
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Just as we did before when counting the total number of colorings fixed by the 
action of some n, let us look for a pattern in the inventories of fixed colorings. Again 
the key to the pattern is the fact that in a coloring left fixed by it, all corners in a cycle 
of 7T must have the same color. Since 1 x 2 has one cycle involving all four corners, 
the possibilities are thus all corners black or all corners white; hence the inventory is 
b 4 + w 4 . The motion 7 T 3 has two 2-cycles ( ac ) and (bd). Each 2-cycle uses two blacks 
or two whites in a fixed coloring. Hence the inventory of a cycle of size two is b 2 + w 2 . 
The possibilities with two such cycles have the inventory ( b 2 + w 2 )(b 2 + w 2 ). 

The inventory of fixed colorings for tx ,• will be a product of factors (b J + w J ), one 
factor for each /'-cycle of the 7 r,- . So we need to know the number of cycles in tt, of each 
size. But this is exactly the information encoded in the cycle structure representation. 
Indeed, setting Xj = (b J + w- 7 ) in the representation yields precisely the inventory of 
fixed colorings for 7T, . By this method we compute the rest of the inventories of fixed 
colorings. See Figure 9.9. For jr 7 especially, the inventory should be checked against 
the list of colorings in column (ii). The pattern inventory is obtained by adding together 
the inventories of fixed colorings, collecting like-power terms, and dividing by 8. 

As before in Section 9.3, we get a more compact formula and save some com- 
putation by first adding together the cycle structure representations and dividing by 
8, and then setting each xj = ( b ■> + w- 7 ) and doing the polynomial algebra all at once. 
Again the first step in this approach yields the cycle index Pc(xi, vo, . . . , Xk). Thus 
by setting xj = (b J + w J ) in Pq , we obtain the pattern inventory. 

If three colors, black, white, and red, were permitted, each cycle of size j would 
have an inventory of (b- 7 + w 7 + r- 7 ) in a fixed coloring. So we would set xj = (V + 
w i _|_ r / j j n p G The preceding argument applies for any number of colors and any 
figure. In greater generality we have the following theorem. 

Theorem (Polya’s Enumeration Formula) 

Let S be a set of elements and G be a group of permutations of S that acts to induce an 
equivalence relation on the colorings of S. The inventory of nonequivalent colorings 
of S using two colors is given by the generating function Pc((b + w), (b 2 + w 2 ), 
( b 3 + w 3 ), ...,(&* + w k )). The inventory using colors Ci , C 2 , . . . , c m is 



For a moment, let us return to the problem of counting the total number of 
nonequivalent 2-colorings. This number is simply the sum of the coefficients in 
the pattern inventory. To sum coefficients, we set the indeterminants, b and w (and 
hence their powers), equal to 1 , or, equivalently, set xj = 2 in P (: . If m colors were 
allowed, we would set xj = m in P c , obtaining the same formula as in the theorem in 
Section 9.3. 

As in many other generating function problems, the actual expansion of the 
generating function for a pattern inventory can be quite tedious. We are expanding 
expressions of the form (c\ + c l 2 — ■ + C' m ) r . When m and r get large, it is time to 
turn to computer algebra software. 
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Example 1: Pattern Inventory for 3-Bead Necklaces 

Determine the pattern inventory for 3-bead necklaces distinct under rotations using 
black and white beads. Repeat using black, white, and red beads. 

From Example 1 of Section 9.3, we know Pq = |(x 3 + 2x3). Substituting xj = 
(b J + w- 7 ), we get 

^[(b + w) 3 + 2 (b 3 + w 3 )] = ' t(^ 3 + 3b 2 w + 3bw 2 + w 3 ) + (2 b 3 + 2 w 3 )] 

= -(3b 3 + 3b 2 w + 3bw 2 + 3 w 3 ) 

3 

= b 3 + b 2 w + bw 2 + w 3 

This result could be obtained empirically. There is only one way to color all beads 
black or all white. If one bead is white and the others black, then by rotation the white 
bead can occur anywhere; thus there is only one necklace with one white and two 
blacks. The same is true by symmetry for a necklace with one black and two whites. 

Now consider 3-bead necklaces using three colors. We substitute xj = (b J + 
w i _|_ r i j n p G t 0 obtain | [(b + w + r) 3 + 2 (b 3 + w 3 + r 3 )]. Instead of expanding the 
polynomials in this expression, we use indirect means. Perhaps again each inventory 
term has coefficient 1 . There is a general test for whether all inventory coefficients are 
1 : Compare N*, the number of terms in the pattern inventory, with N, the total number 
of patterns [i.e., P (: ( m . m , , m)\. Since /V equals the sum of the coefficients in the 
inventory, then N * = N if and only if each term has coefficient 1 . The number N* of 
terms in the pattern inventory when n elements (corner, beads, etc.) are colored with 
m colors is C(n + m — 1, n) (see Exercise 19). For the case at hand, m = 3. n — 3, 
and so N* = ( 3+3_1 ) = 10. From Example 1 of Section 9.3, we know that IV = 11. 
Since N* / N, we know that all terms do not have coefficient 1. 

On the other hand, the only way for IV* = N + 1 is that nine terms in the inventory 
must have coefficient 1 and one term coefficient 2. But as argued above, there is only 
one necklace with all beads the same color and only one necklace with one bead of 
color A and the other two beads of color B. The only other possibility for a 3-bead 
necklace using three colors is to have one bead of each color. Thus, there must be 
two necklaces with one bead of each color (the necklaces are any cyclic order of the 
three colors and the reverse cyclic order), and the pattern inventory for black, white 
and red 3-bead necklaces is 

b 3 + w 3 + r 3 + b 2 w + b 2 r + w 2 b + w 2 r + r 2 b + r 2 w + 2 bwr m 


Example 2: Pattern Inventory for 7-Bead Necklaces 

Find the number of 7-bead necklaces distinct under rotations using three black and 
four white beads. 

We need to determine the coefficient of b 3 vv 4 in the pattern inventory. Each 
rotation, except the 0° rotation, is a cyclic permutation when the number of beads is 
a prime [see Exercise 13(a)], so Pc = \(x[ + 6x7). The pattern inventory is l[{b + 
w) 7 + 6(b 7 + w 7 )]. 
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Since the factor 6 (b 1 + w 1 ) in the pattern inventory contributes nothing to the if w 4 
term, we can neglect it. Thus the number of 3-black, 4-white necklaces is simply 

— [(coefficient of b 3 w 4 in ( b + w) 1 } = — ^ 

Example 3: Pattern Inventory for Edge 2-Colorings of a Tetrahedron 

Find the pattern inventory of black-white edge colorings of a tetrahedron. 

Although we calculated the cycle index for corner symmetries of the tetrahedron 
in Example 2 of Section 9.3, we need a different cycle index for edge symmetries. 
Since the set of objects to be colored is the six edges, we need to consider the 
symmetries of the tetrahedron as permutations of the edges. 

The 0° revolution clearly leaves all edges fixed and thus has cycle structure 
representation x®. The 120° (or 240°) revolution about a corner and the middle of 
the opposite face cyclicly permutes the edges incident to that corner and cyclicly 
permutes the edges bounding the opposite face (see Figure 9.4 in Section 9.1). Thus, 
the 120° revolution has cycle structure representation x\. 

The 180° revolution about opposite edges leaves those two edges fixed (see 
Figure 9.4). Since two applications of a 180° revolution return the tetrahedron to its 
original position, the other four edges not left fixed must be in 2-cycles. Thus, the 1 80° 
revolution has cycle structure representation x\x\ . Then Pq = -^(xj 1 + 8x| + 3x\x 2 ). 
Substituting x ; - = ( b j + w 7 ), we get 

^[(b + wf + 8 (b 2 + w 2 ) 2 + (b + w) 2 (b 2 + w 2 ) 2 ] 

= — [(b 6 + 6b 5 w + I5b 4 w 2 + 20b 3 w 3 + 15b 2 w 4 + 6bw 5 + w 6 ) + (8b 6 + I6b 3 w 3 
12 

+ 8w 6 ) + (3b 6 + 6 b 5 w + 9 b 4 w 2 + 12 Z? V + 9 b 2 w 4 + 6 bw 5 + 3w 6 )] 

= -^(12Z? 6 + I2b 5 w+ 24b 4 w 2 + 486V + 24 b 2 w 4 + 12 bw 5 + 12 w 6 ) 

— b 6 + b 5 w + 2b 4 w 2 + 4 b 3 w 3 + 2 b 2 w 4 + bw 5 + w 6 ■ 

Example 4: Pattern Inventory for Corner 2-Colorings of a Cube 

Find the pattern inventory for corner 2-colorings of a floating cube. 

The symmetries of the cube involve revolutions about opposite faces, about oppo- 
site edges, and about opposite corners. First, of course, there is the identity symmetry, 
with cycle structure representation x 3 . 

(a) Opposite faces: As a concrete example, we revolve about the center point of 
the pair of opposite faces abed and efgh; see Figure 9.10a. A 90° revolution 
yields the permutation ( abed) (efgh) with cycle structure representation x\. A 
270° revolution has the same structure. A 180° revolution yields the permutation 
(ac)(bd)(eg)(fh) with cycle structure representation x\. There are three pairs of 
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Figure 9.10 Revolutions of the cube, (a) Revolution about opposits faces, 
(b) Revolution about opposite edges, (c) Revolution about opposite corners. 


opposite faces and so the total contribution to the cycle index of opposite-face 
revolutions is 6xj + 3x\ . 

(b) Opposite edges: As a concrete example, we revolve about the middle of oppo- 
site edges ad and fg; see Figure 9.10ft. A 180° revolution yields the permuta- 
tion (ad)(bh)(ce)(fg) with cycle structure representation x\. There are six pairs 
of opposite edges, and so the total contribution of opposite edge revolutions 
is 6x* ■ 

(c) Opposite corners : As a concerete example, we revolve about the opposite corners a 
and g\ see Figure 9. 10c. A 120° revolution yields the permutation ( a)(bde)(chf)(g ) 
with cycle structure representation x\x\. One way to see what this permutation 
does is by noting that the three corners ft, d, e adjacent to a must be cyclically 
permuted in any motion that leaves a fixed (and similarly for the corners adjacent 
to g). A 240° revolution has the same structure. There are four pairs of opposite 
corners and so the contribution of opposite-corner revolutions is 8;c 2 x\. 

We leave it to the reader (see Exercise 17) to verify that we have enumerated all 
symmetries of the cube and that these symmetries are all distinct. 

Collecting terms, we find Pq = ^(x 8 + 6x| + 9x 4 + 8x[x|). The pattern inven- 
tory for corner colorings of the cube using black and white is thus 

/- [(ft + w) 8 + 6(ft 4 + w 4 ) 2 + 9 (ft 2 + w 2 ) 4 + 8(ft + w) 2 (ft 3 + w 3 ) 2 ] 

As in previous examples, the coefficients of the terms ft 8 , ft 7 w, ftw 7 , and vv 8 are 
readily seen to be 1. The b 6 w 2 , ft 5 w 3 , and ft 4 w 4 terms in the four factors of the 

generating function are ( h 28ft 6 w 2 + 56ft s w 3 + 70ft 4 w 4 H ), 6( h2ft 4 w 4 + 

• • •), 9( 1- 4ft 6 w 2 + 6ft 4 w 4 H ), and 8(- • • + ft 6 w 2 + 2ft V + 4ft 4 w 4 + • • •), re- 

spectively. Summing and dividing by 24 we have ^(- — h 72ft 6 w 2 + 72ft s w 3 + 
168ft 4 w 4 + •••) = 3 ft 6 w 2 + 3 ft 5 w 3 + 7ft 4 w 4 . (It is easy to detect errors in these cal- 
culations, since most errors will result in a noninteger coefficient.) By symmetry, we 
fill out the pattern inventory to obtain 


ft 8 + b 7 w + 3ft 6 w 2 + 3ft 5 w 3 + 7ft 4 w 4 + 3ft 3 w 5 + 3 ft 2 w 6 + ftw 7 + w 8 ■ 
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9.4 EXERCISES 


' Summary of Exercises The first 16 exercises use Polya’s enu- 
meration formula, and the remaining problems involve associated theory. Note that 
“floating” means that all possible rotations and reflections are allowed. 

1. Find the pattern inventory for black and white corner colorings of a floating 
pentagon. 

2. Find an expression for the pattern inventory for black-white, n-bead necklaces 
(rotations only) and find the number of necklaces with three white beads and the 
rest black: 

(a) n = 6 (b) n = 9 (c) n = 10 (d)«=ll 

3. Find the pattern inventory for black, white, and red corner colorings of a floating 
square. 

4. Find an expression for the pattern inventory for the 2-colorings (rotations only) 
of the 16 squares in a 4 x 4 chessboard. 

5. Find an expression for the pattern inventory for black-white corner colorings of 
the floating figures in Exercise 5 of Section 9.3. 

6. (a) Find the pattern inventory for corner 2-colorings of a floating pyramid (with 

a square base). 

(b) Repeat part (a) for edge colorings. 

(c) Repeat part (a) for face colorings. 

7. Find an expression for the pattern inventory for edge 2-colorings of the floating 
figures in Exercise 5 in Section 9.3. 

8. Find an expression for the pattern inventory for edge 2-colorings of a floating cube 
and find the number of edge 2-colorings with three white and nine black edges. 

9. Find an expression for the pattern inventory for face 2-colorings of 

(a) A floating tetrahedron (b) A floating cube 

10. (a) Find the pattern inventory for the corner 3-colorings of a floating pentagon 

with adjacent corners different colors. 

(b) Repeat part (a) for a floating tetrahedron. 

(c) Repeat part (a) for a floating cube. 

11. Find an expression for the pattern inventory for black-white colorings of four 
indistinguishable balls (see Exercise 10 in Section 9.3). 

12. Give an empirical argument (without use of the cycle index) to show that there 
are |_ § J different n-bead necklaces with 2 white beads and n — 2 black beads ( |_r J 
is the largest integer < r). 

13. Given that p is a prime and p-bead necklaces are made of black and white beads, 

(a) Show that each rotation except 0° is a cyclic permutation of the comers 

(b) What is the number of such necklaces with exactly k white beads? 
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14. (a) Suppose that instead of coloring the comers of a floating square, we attach 

0 or 1 or 2 identical jelly beans at a corner. Find a generating function, and 
expand it, for the number of different squares with a total of k beans at its 
corners. (Hint: In the inventory of patterns left fixed, the exponents within a 
factor will vary.) 

(b) Repeat part (a) with each corner getting a red jelly bean or a white jelly bean 
or a red and a white jelly bean. Now find a generating function for the number 
of squares with j whites and k reds. 

15. Suppose n batons with small holes drilled through their midpoints are strung 
along a piece of wire and each end of each baton is painted one of three possible 
colors (red, white, blue). The wire is fixed, but the batons revolve about their 
centers. How many indistinguishable configurations are there for 


(a) n—2 


(b) n = 3 


16. How many distinct (nonisomorphic) unlabeled graphs are there with four ver- 
tices? (Hint: Each possible edge has two possible colors: “edge present” and 
“edge absent.”) 

17. Show that the 24 symmetries of a cube listed in Example 4 are all distinct and 
include all symmetries of a cube. 

18. Characterize the geometric figures that when floating have only one coloring with 
one black corner and the others white. 

19. Show that there are C(n + m — 1 , n) different terms in the pattern inventory for 
;n-colorings of corners of an n-gon. 

20. Let A*,,, denote the minimum number of coefficients in the pattern inventory of 
all ^-colorings of the corners of an «-gon needed so that using symmetry all other 
coefficients in the pattern inventory are known. Find a generating function gk(x) 
for Gkji . 



9.5 SUMMARY AND REFERENCES 



Polya’s enumeration formula is important, practically because it solves important 
problems, mathematically because it is an elegant marriage of group theory and 
generating functions, and pedagogically because it is the only truly powerful combi- 
natorial formula we will see in this book (where a difficult problem can be solved by 
“plugging” the right numbers into a specialized formula). Of equal importance is the 
manner in which Polya’s formula has been developed: Sections 9.3 and 9.4 could be 
read as a case study in the experimental derivation of a mathematical theory. (This 
approach follows the pedagogical style Polya used when he taught this material.) Stu- 
dents interested in a more rigorous development of this theory and its extensions are 
referred to [2], which includes an English translation of Polya’s original 1937 paper, 
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“Kombinatorische Anzahlbestimmungen fur Gruppen, Graphen und chemische 
Verbindungen 

In a more formal development, one defines a coloring as a function / from a set 
S (of corners) to a set R (of colors). The function fo, corresponding to C 6 (in Figure 
9.1), would be written in tabular form as We define the composition jr • / 

of a motion and a color function to be a new color function /'; for example, jt 2 ■ f( 
maps a-^b-^w, c-^ b, c-^ d b, d -+ a ->■ w, or jt 2 • / 6 = {^bJb) = ft- 

The coloring permutation induced by a symmetry tt maps each / to it ■ f. Although 
tedious, a formal calculation of the coloring permutation is thus possible (before we 
did it by inspection). We define coloring equivalence by / ~ /' if and only if there 
exists 7 r such that n ■ f = f . Our theory can readily be restated in terms of this new 
definition. 

Polya’s enumeration formula has an important application to another field of 
combinatorial mathematics. It is used to enumerate families of graphs (see Exercise 
16 in Section 9.4). This application was pioneered by F. Harary; see Harary and 
Palmer [1]. 

1. F. Harary and E. Palmer, Graphical Enumeration, Academic Press, New York, 
1973. 

2. G. Polya and R. C. Reade, Combinatorial Enumeration of Groups, Graphs, and 
Chemical Compounds, Springer- Verlag, New York, 1987. 
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CHAPTER 10 

GAMES WITH GRAPHS 



PROGRESSIVELY FINITE GAMES 


In this chapter we develop the theory of progressively finite games and apply it to Nim- 
type games. While clearly not an important use of graphs, our graph-theoretic analysis 
of these games illustrates some very abstract strategies for attacking very practical 
problems that seem far removed from such abstraction. Further, it is essential to study 
how graph models can be used to solve a variety of real-world problems, but there 
is a more personal reward in learning how graphs can permit one to win at certain 
games. 

A game in which two players take turns making a move until one player wins 
(no ties are allowed) is called progressively finite if (1) there are a finite number of 
different positions in the game and (2) the play of the game must end after a finite 
number of moves. 

Our objective in this section is winning: how to determine winning strategies for 
progressively finite games. We can model a progressively finite game by a directed 
graph with a vertex for each position that can occur in the play of the game and a 
directed edge for each possible move from one position to another. 

Observe that the graph of a progressively finite game cannot contain any directed 
circuits, since players could move around and around a circuit of positions forever 
(violating the constraint on finite play). Thus, no positions can ever be repeated in the 
play of a game. Rather, the game moves inexorably toward some final position that is 
a win for one of the players. Games such as checkers and chess that permit ties and 
repetition of positions are not progressively finite. Most progressively finite games 
are “takeaway”-type games of the sort illustrated by the games in Examples 1 and 2. 


Example 1: Restricted Takeaway Game 

A set of 16 objects is placed on a table. Two players take turns removing 1, 2, 3, or 4 
objects. The winner is the player who removes the last object. The graph of this game 
is shown in Figure 10.1 (all edges are directed from left to right). ■ 


Example 2: Inverted Takeaway Game 

Starting with an empty pile, two players add 1 penny or 2 pennies or 1 nickel to the 
pile until the value of the pile is the square of a positive integer > 2 or until the value 
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16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


Figure 10.1 

exceeds 40. The player whose addition brings the value of the pile to one of these 
critical amounts takes all the money in the pile. The graph of this game is shown in 
Figure 10.2 (all edges are directed from left to right; arrows point to the game-winning 
positions). This game is an inverted form of a “takeaway” game in which 1, 2, or 5 
objects are withdrawn from a pile until the pile is reduced to certain critical sizes. ■ 

The reader is encouraged to try playing these games with a friend. A winning 
strategy for the first player (who makes the first move) in the game in Example 1 can 
be found with a little thought. The game in Example 2 is harder. The details of this 
second game only serve to confuse systematic attempts to find a winning strategy. 
It is easier to develop a general theory of winning strategies in progressively finite 
games and then apply the theory to Example 2. 

A winning position in a progressively finite game is a position at which play 
stops and the player who moved to this position is declared the winner. 

In the graph of a progressively finite game, vertices with 0 out-degree must repre- 
sent winning positions, for if no edges leave a vertex, the game must stop at that vertex. 
We call such vertices winning vertices. The graph of the game in Example 1 has just 
one winning vertex, numbered 0, while the graph of the game in Example 2 has six 
winning vertices, numbered 4, 9, 16, 25, 36, and “over 40.” Every progressively finite 
game must have at least one winning position or else play would go on without end. 

A winning strategy for a player is a rule that tells the player which move to make 
at each stage in a game so as to ensure that the player will eventually win, that is, finally 
move to a winning position. Obviously only one player can have a winning strategy. 

Any vertex adjacent to a winning vertex is a “losing” vertex in the sense that if 
one player moves to a “losing” vertex, the other player can then move to a winning 
vertex (and win the game). The vertices numbered 1, 2, 3, and 4 are losing vertices 
in the game in Example 1. The vertices numbered 2, 3, 7, 8, 11, 14, 15, 20, 23, 24, 
31, 34, 35, 37, 38, 39, and 40 are losing vertices in the game in Example 2. 

Stepping back one more move from the end of the game, we see that if all edges 
from a vertex x go to losing vertices, then x is a “prewinning” vertex. Whenever 
player A moves to such a prewinning vertex, then player B ’s next move must be to a 
losing vertex and now player A can move to a winning vertex. Vertex 5 in the game 
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in Example 1 is a prewinning vertex, since all edges from 5 go to losing vertices 
(1, 2, 3, and 4). Vertices 6 and 33 are the prewinning vertices in the game in Example 
2; all other vertices adjacent to losing vertices are also adjacent to nonlosing vertices. 

The general theory of winning strategies in progressively finite games is based 
on a recursive extension of the preceding reasoning. We seek good vertices, such 
as winning and prewinning vertices, that win or lead only to bad vertices. When our 
opponent is forced to move to a bad vertex, such as losing vertices, then we will always 
be able to move to another good vertex. A winning strategy for the first player will 
tell the player how to find good vertices to move to from any bad vertex. This pattern 
of play, with the first player moving to successive good vertices and the second player 
forced to move to bad vertices, will continue until finally the first player reaches a 
good vertex that is a winning vertex. If the game starts at a good vertex, and so the 
first player’s initial move must be to a bad vertex, then roles are reversed and it is the 
second player who now has a winning strategy using the good vertices. 

Example 1: (continued) 

The winning strategy for the first player in this game is to move to a vertex whose 
number is a multiple of 5. These are the good vertices. Thus, the move from the 
starting vertex 16 is to vertex 15 (i.e., the first player removes one object). From 15, 
the second player must move to one of the vertices 11, 12, 13, or 14. From any of these 
bad vertices the first player now moves to vertex 10. Whatever the second player’s 
next move is, the first player will always be able to move to the prewinning vertex 5, 
and one round later the first player will win. Note that if the game started with only 
15 objects, then the roles would be reversed and the second player would be able to 
use this winning strategy. ■ 

We formalize the concept of good vertices with the following definition. A kernel 
in a directed graph is a set of vertices such that both of the following are true: 

1. There is no edge joining any two vertices in the kernel. 

2. There is an edge from every nonkernel vertex to some kernel vertex. 

The kernel of the graph in Example 1 is the set of vertices numbered 0, 5, 10, 
and 15. 

Theorem 1 

If the graph of a progressively finite game has a kernel K, then a winning strategy for 
the first player is to move to a kernel vertex on every turn. However, if the starting 
vertex is in the kernel, then the second player can use this winning strategy. 

Proof 

First we show that all winning vertices must be in K. The reason is that vertices not 
in a kernel must have an edge directed to a vertex in the kernel. But winning vertices 
have 0 out-degree. Thus, all winning vertices must be in any kernel. 
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Next we show that moving to a kernel vertex on every turn is a winning strategy 
for the first player. By property (1) of a kernel, when the first player moves to a kernel 
vertex, the second player must then move to a nonkernel vertex. Then by property (2), 
the first player can always move from the nonkernel vertex to a kernel vertex. The play 
proceeds in this fashion with the first player always moving to a kernel vertex and the 
second player always moving to a nonkernel vertex. Since the game is progressively 
finite, the play must eventually end. Since all winning vertices are in the kernel, the 
first player must win. 

If the starting vertex is in the kernel, then the roles of the first and second players 
are reversed and the second player can always move to a kernel vertex for an eventual 
win. ♦ 


The preceding proof does not show explicitly how successively moving to kernel 
vertices leads to a winning vertex. The proof is existential. It shows only that by 
moving to kernel vertices the first player must eventually arrive at a winning vertex. 

A more immediate problem is to show that the graph of every progressively 
finite game has a kernel and then to find this kernel. Not all graphs have kernels. 
For example, the graph in Figure 10.3 has no kernel. To show that this graph has no 
kernel, we argue thus. If there were a kernel, we could assume by the symmetry in 
this graph that vertex a is in the kernel. Then vertex e, which has an edge to a, cannot 
be in the kernel. Vertex c/’s only outward edge goes to nonkernel vertex e, and so d 
must be in the kernel. Similarly, c cannot be in the kernel and then b must be in the 
kernel. But now the edge (a~ b) joins two kernel vertices — a contradiction. 

Fortunately, graphs of progressively finite games do have kernels. To demon- 
strate the existence of kernels in such graphs, we need to organize the vertices in a 
progressively finite graph into levels based on the “distance” of the vertices from a 
winning vertex. We recursively define the level /(x ) of vertex x in a directed graph 
and the sets of vertices at level < k as follows. Let s(x) = {y\x has an edge to y } 
be the set of successors of x. Then 

l(x) — 0 s(x) is empty and Lq = {x \ l(x) = 0} 

l(x) — 1 <£> x ^ Lq ands(x) c Lq and L\ — Lq U {x | /(x) = 1} 

and, in general, 

/(x) = k x £ Lk - 1 and^(x) C L*_i and L t t = L^-i U {x | /(x) = A:} 

Observe that L /. — L^~\ is the set of vertices at level k. 
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It can be shown that /(x) is the length of the longest path in the directed graph that 
starts at x (see Exercise 12). Since all paths in a progressively finite graph have finite 
length, the longest path from a vertex x has finite length. Further, the longest path 
from x must end at a vertex of out-degree 0; otherwise the path could be extended. 
Then starting from vertices of out-degree 0, the assignment of level numbers by this 
recursive definition will eventually reach all vertices. 

It follows from this definition of level that every vertex at level 0 is a winning 
vertex and that every vertex at level 1 is a losing vertex. A vertex at level 2 is also a 
losing vertex if it is adjacent to a vertex at level 0; it is a prewinning vertex only if all 
its successors are at level 1 . In general, every vertex at level k(k > 0) must be adjacent 
to a vertex at level k — 1 and possibly other vertices at lower levels, but cannot be 
adjacent to any other vertex at level k (or greater). Now we can prove the fundamental 
theorem for progressively finite games. 

Theorem 2 

Every progressively finite game has a unique winning strategy. That is, the graph of 
every progressively finite game has a unique kernel. 

Proof 

The proof is by induction on the levels or, more precisely, on the sets Lf ; . Let AT be 
the set of kernel vertices in L First consider the set Lq. Lq consists of vertices with 
0 out-degree. These are the winning vertices. As noted in the proof of Theorem 1, all 
winning vertices must be in the kernel. Thus Kq — Lq. 

Next let us inductively assume for n > 1 that K n _\ is the unique, well-defined 
set of kernel vertices in L„_i. We show that we can find a unique set of level-/! 
vertices that must be added to K „ \ to form the kernel K n for L n . By the way that 
level numbers were defined, /(x) = n means that ,v(x) C L„_ j. If a level-// vertex x is 
adjacent to no kernel vertex of A"„-i , this x must be in K„ (since by the definition of 
a kernel, any vertex with no successors in the kernel must itself be in the kernel). On 
the other hand, if x is adjacent to a kernel vertex of K„^\, x cannot be in the kernel. 
Hence K n = K„_\ U {x | l(x) — n and s(x) fl K n _ i = 0} is the unique, well-defined 
set of kernel vertices in L„ . It follows by mathematical induction that the graph has 
a unique kernel. By Theorem 1, this kernel is the unique winning strategy. ♦ 

The proof of Theorem 2 tells us how to build a kernel. First put the winning 
vertices in the kernel. Then recursively add the vertices at increasing levels not adjacent 
to the current set of kernel vertices. We implement this procedure for finding kernels 
using a labeling rule called a Grundy function. 


Definition of Grundy Function g(x) 

For each vertex x in a directed graph, g(x) is the smallest nonnegative integer 
not assigned to any of x’s successors. 
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We shall prove shortly that vertices with Grundy number 0 are exactly the set of 
kernel vertices. In the next section, Grundy numbers will be seen to play a fundamental 
role in more complex games. 

In the graph of a progressively finite game, Grundy values can be easily deter- 
mined using a level-by-level approach. The vertices on level 0 have no successors 
and so their Grundy number will be 0 (the smallest nonnegative integer). Next we 
determine g(x) for vertices x at level 1, then vertices at level 2, and so forth. In this 
way, all of a vertex x’s successors are assigned Grundy numbers before it is time to 
determine the Grundy number of x, since x’s successors are at lower levels. When 
we come to x, we can check the Grundy function values of s(x), x’s successors, and 
set g(x) equal to the smallest nonnegative integer not assigned to any vertex in s(x). 
Actually we do not need to proceed in a totally level-by-level fashion. We can use any 
method that does not try to assign a vertex its Grundy number until all the vertex’s 
successors have Grundy numbers. 

No Grundy function can be defined for the graph in Figure 10.3. Each vertex x in 
Figure 10.3 has a successor whose Grundy number must be defined before g(x) can be 
determined. Even if we try to invent simultaneously Grundy numbers for all vertices 
in Figure 10.3 at once, no Grundy function exists (details are left to the reader). 

The Grundy numbers for the vertices in Figures 10.1 and 10.2 are given in the 
following tables. Note that the vertices in Figure 10.1 that are in the kernel — namely, 
0, 5, 10, 15 — all have Grundy number 0. This is no accident. 

Table of Grundy Numbers for Figure 10.1 

Vertex x 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 

g(x) 1 0 4 3 2 1 04321043210 


Table of Grundy Numbers for Figure 10.2 


Vertex x 0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 11 

12 

13 14 

15 

16 

g00 0 

3 

1 

2 

0 

1 

0 

2 

1 

0 

3 1 

0 

3 1 

2 

0 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 28 

29 

30 



1 

0 

2 

1 

0 

3 

2 

1 

0 

2 

0 1 

2 

0 



31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

over 40 





1 

2 

0 

1 

2 

0 

1 

3 

2 

1 

0 






Theorem 3 

The graph of a progressively finite game has a unique Grundy function. Further, the 
vertices with Grundy number 0 are the vertices in the kernel. 

Proof 

The recursive level-by-level construction of a Grundy function for the graph of a 
progressively finite game gives each vertex a unique Grundy number, as described 
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above. By the definition of a Grundy function, a vertex x with Grundy number 0 
cannot have a successor with Grundy number 0 (or else x would have to have a 
different number). Similarly any vertex y with Grundy number g(y) — k > 0 must 
have an edge to some vertex with Grundy number 0 (or else v’s number would be 0). 
Thus, the set of vertices with Grundy number 0 satisfies the two defining properties 
of a kernel. ♦ 

Example 2: (continued) 

By Theorem 3 and the foregoing table of Grundy numbers for Figure 10.2, we see 
that the kernel is the set 0, 4, 6, 9, 12, 16, 18, 21, 25, 27, 30, 33, 36, “over 40.” Since 
the starting vertex is in the kernel, the second player has the winning strategy in this 
game. A play of the game might proceed as follows (let player A be the first player 
and B be the second player): first A moves to 1 , then B moves to kernel vertex 6, then 
A must move to 1 1 (a move to 7 or 8 lets B win at 9), then B moves to kernel vertex 
12, then A must move to 17, then B moves to kernel vertex 18, then A moves to 20, 
and then B moves to the winning vertex 25 (and collects the 25 cents). ■ 



10.1 EXERCISES 


Summary of Exercises The first 10 exercises involve finding ker- 
nels and Grundy functions in various graphs of progressively finite games. Exercises 
1 1-17 involve proofs of properties of kernels, Grundy functions, and level numbers. 
Exercises 18 and 19 present two more complicated progressively finite games. 


1. Find a kernel in the following graphs or show why none can exist: 



2. Suppose that there are 25 sticks and two players take turns removing up to five 
sticks. The winner is the player who removes the last stick. Which player has a 
winning strategy? Describe this strategy. 

3. Repeat Example 2 with 2, 3, or 7 cents added each time. Find the set of positions 
in the kernel. 

4. Show that in Example 2 the second player can always win by the second move. 

5. Suppose in Example 2 that the first player A knows he/she will lose and wants 
to minimize his/her loss. What is the smallest winning amount A can force B 


392 


Chapter 10 Games with Graphs 


(the second player) to accept (such that if B tried to make the game go longer, 
then A could get into the kernel and win)? 

6. (a) Suppose we have a pile of seven red sticks and 10 blue sticks. A player can 

remove any number of red sticks or any number of blue sticks or an equal 
number of red and blue sticks. The winner is the player who removes the last 
stick. Find the set of positions in the kernel. 

(b) Repeat part (a) with a limit of removing at most five sticks of one color (or 
both colors) on a move. 

7 . Repeat Example 1 — but now the player to remove the last object loses. Describe 
the winning strategy for this game. 

8. Find the Grundy function for graphs or games in 

(a) Exercise 1(a) (b) Exercise 1(b) (c) Exercise 3 

9. Show that there is no Grundy function for the graph in Figure 10.3. 

10 . Find a directed graph possessing a kernel but no Grundy function. 

11 . If W (S') is the set of vertices without an edge directed to any vertex in the set S, 
show that a set S is a kernel if and only if .S' = W (S'). 

12 . (a) Show that if l{x) — k for a vertex x in the progressively finite graph G, then 

k is the length of the longest path starting at x in G. 

(b) Show that if g(x) = k, there is a path of length k starting at x in G. 

13 . Show that for any vertex x in a progressively finite graph, g(x) < l(x). 

14 . Show that if a directed graph G and every subgraph of G (obtained by deleting 
various vertices) have kernels, then G has a Grundy function. 

15 . Show that both the level numbers and the Grundy function in a progressively 
finite graph G constitute proper colorings of G. 

16 . Show that no matter how the edges are directed in a bipartite graph, it will always 
have a Grundy function. 

17 . Show that the graph of a progressively finite game can have only a finite number 
of vertices. [Hint: Show that if there are an infinite number of vertices, then there 
must be an infinite path (infinite play).] 

18 . Consider the following graph game. Player A tries to make a path with a set of 
vertices from the left to the right side of this graph. Player B tries to make a path 
from top to bottom. The players take turns picking vertices until one player gets 
the desired path. 
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(a) Show that this is a progressively finite game. 

(b) Find a winning strategy for Player A. 

19. The game of kayles has a row of n equally spaced stones. Two players alternate 
turns of removing one or two consecutive stones (with no intervening spaces). 
The player to remove the last stone wins. 

(a) Draw the graph and find its Grundy function for a four-stone game. 

(b) Repeat part (a) for a six-stone game. 



10.2 NIM-TYPE GAMES 


In this section we extend the theory of progressively finite games to takeaway games 
involving several piles of objects. The simplest game of this form is called Nim, a 
game in which two players take turns removing any number they wish from one of the 
piles. The winner is the player who removes the last object from the last remaining 
(nonempty) pile. While the positions in the two games in Examples 1 and 2 in the 
previous section could be described with a single nonnegative integer representing 
the size or monetary value of the single pile, the position in a Nim game requires a 
vector of nonnegative integers {p\, P 2 , ■ ■ ■ , Pm), the Arth number representing the 
current size of the f th pile. 


Example 1: Game of Nim 

Consider the Nim game with four piles of sticks: one stick in the first pile, two sticks 
in the second pile, three in the third, and four in the fourth. See Figure 10.4. We 
represent the initial position of this game with the vector (1, 2, 3, 4). 

Let the first and second players be named A and B, respectively. A sample play 
of the game might go as follows. First A removes all four sticks from the fourth pile. 
The new position is (1, 2, 3, 0). Next B removes one stick from the third pile to 
produce position (1, 2, 2, 0). Now A removes the one stick in the first pile to produce 
position (0, 2, 2, 0). A has been playing a winning strategy — that is, moving into 
kernel positions — and is now about to win. If B removes all of the second or third 
pile, A will remove the other pile; or if B removes just one stick, A will remove one 
stick from the other pile and A will win on the next round. The reader is encouraged 
to play this Nim game with a friend. ■ 


Figure 10.4 I I I I 
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Figure 10.5 

The game in Example 1 has 2x3x4x5=120 different positions (the z'th pile 
has i + 1 possible sizes, 0,1,...,/). Thus, it would be very cumbersome to draw the 
graph of this game and compute its kernel with a Grundy function, as in the previous 
section. In general, a Nim game with m piles and n, objects in the ith pile will have 
(n | + 1)(«2 + 1) ■ • • (n m + 1) different positions. The only way that we could ever 
play winning Nim without a computer is if we could determine the Grundy number 
of a position (vertex) directly from the position vector (pi, p 2 , . . . , p m ). Fortunately, 
such direct computation is possible. 

First we need to examine the Grundy function for a single-pile Nim game. Con- 
sider the Nim game with one pile of four sticks. See Figure 10.5. The graph of this 
game has edges (zT j), for all 0<j<i< 4. The Grundy number of vertex 0, the 
winning vertex, is 0; the Grundy number of vertex 1 is 1 ; and so on. In any one-pile 
Nim game, vertex i will have a Grundy number of i, since vertex i has edges to all 
lower-numbered vertices. Of course, the strategy of any one-pile Nim game is trivial: 
the first player removes the whole pile and wins. The nice form of this single-pile 
Grundy function will simplify the computation of Grundy functions for multi-pile 
Nim games. 

Although the graph of a multi-pile Nim game is too complex to draw, 
we can still describe it symbolically. We have a vertex for each position 
(p i, pi. . . . , Pm), where 0 < pi < n, («,- is the initial size of the z'th pile). Since 
the only permissible moves are removing some amount from one pile, the as- 
sociated graph has edges from vertex (pi, p 2 , . ■ . , p m ) to vertex (q\ , <72 , . . . , q m ) 
for each pair of vertices such that for one j.qj, < Pj, and for all i ^ j. q, = 
p t . This graph is in some sense a “composition” of the graphs for each 
pile, for if we fix all p, except one, say pi, then the subgraph of vertices 
(pu Pi, 0, P4, . . . , p m ), (pu P 2 , 1, PA, ■■■, Pm), ■■■, ( Pi , Pi, Pa, ■ ■ ■ , Pm) is ex- 
actly the graph for pile 3 alone. 

This type of composition of graphs can be formalized as follows. The direct 
sum H = H\ + H 2 H — • + H m of graphs H\, H 2 , ... , H m with vertex sets X \ , 
X 2 , . . . , X m , respectively, has vertex set X = {(x\, X 2 , ■ ■ ■ , x m ) \ Xi e A), 1 < i < m} 
and edges defined by the successor sets 

s((xi, x 2 , . . . , x m )) = {(v, * 2 , x 3 , . . . , x m ) | y e .s'(x,)} 

U{(xi, y, x 3 , . . . , x m i) I y e x(x 2 )} 


U[(xi,x 2 , . ■ • y) | y e s(x m )} 

It follows from this definition that the graph G of an zzz-pile Nim game is the direct 
sum of the graphs G, of the z'th pile: G = G \ + G 2 ~i — • + G m . 
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Example 2: Graph of 2 X 2 Nim 

Consider the simple Nim game with two piles of two objects each. Figure 10.6 a shows 
the graph G i = G 2 of one pile alone. Figure 1 0.6/; shows the graph G — G\ + G 2 of 
the two-pile game. Find the Grundy function and kernel of this graph. 

The Grundy numbers for the vertices are shown in Figure 10.6k. The vertices 
with Grundy number 0 are in the kernel. Since g(( 2, 2)) = 0, the second player has 
the winning strategy, according to Theorems 1 and 3 in the previous section. ■ 


Next we show how to compute the Grundy number of a vertex (jti, X 2 , . . ■ , x m ) 
in a direct sum of graphs from the Grundy numbers g(x, ) of the vertices x, in the 
component graphs G The computation to be performed on these Grundy numbers 
is called a digital sum. 

The digital sum c of nonnegative integers c\, C 2 , ■ ■ ■ , c m , written 
c = Ci+ C 2 + C3 + • • • + c m 

is computed in the following manner. Let c (k> be the kth binary digit in a binary 
expansion of c, that is, c = c (0) + c (1 ^2 + c (2> 2 2 + ■ ■ ■ , and similarly let c k) be the kth 
digit in a binary expansion of c,-. Then c (<:) = c (k> + c {k) + • — h c® (modulo 2). That 
is, the kth binary digit of c is 1 if the sum of the kth binary digits of the c: ; s is odd, 
and the kth binary digit of c is 0 if the sum of the kth binary digits of the CjS is even. 

Example 3: Digital Sum 

Compute the digital sum 2+12+15 + 8. We write the numbers 2, 12, 15, and 8 
in binary form and determine the sum (modulo 2) of the digits in each column, as 
illustrated in Figure 10.7. Translating the value of the binary sum back into an integer, 
we see that this digital sum equals 9. ■ 

We now present a remarkable theorem. In essence, it says that digital sums are 
the only way to win at Nim. 

Theorem 

If the graphs G \ , G 2 , ■ ■ ■ , G n possess Grundy functions g(x), then the direct sum G = 
G\ + G 2 + ■ — b G m possesses a Grundy function g(x), where x = (xi, X 2 , . . . , x m ), 
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2 3 2 2 2 ' 2 ° 

2=00 1 0 
12 = 1 1 0 0 

15 = 1 1 1 1 

8 = 10 0 0 

Figure 10.7 1 0 0 1 = 9 


defined by 


g(x) = g((x 1 ,x 2 ,...,x m )) = gl(x I ) + g 2 (x 2 )+ +g m (x m ) 


Proof 

We must show that g(x) is the smallest nonnegative integer that is not equal to any 
number in the set (g(y) | y e .v(.r)}. This definition of a Grundy function can be broken 
into two parts: (1) if y e s(jc), then g(x) ^ g(y), and (2) for each nonnegative integer 
b < g(x), there exists some y e s(x) with g(y) = b. 

Part (1): Show that if y e s(x), then g(x) g(y). Since y = (yi, y 2 , . . . , y,„) e 
s((x i, x 2 , ... . x m )), then by the definition of direct sum, for some j, yj e s(xj), and 
forall; j,yt = jc,- . If Ck = gk{xk)wt\ddk — gk(vk), then Cj rZ/andc,- = dj. So 

g(x) = ci+c 2 + ■ ■ • + cj + ■■ ■ +c m = C' + Cj 

g(y) = d\+ d 2 + ■ ■ ■ + dj + ■ ■ • + d m = c 1 + dj 

where c’ is the digital sum of all cs except cj . It is not hard to show that 

c' + Cj = c' + dj if and only if Cj = dj (see Exercise 9). Since Cj di, we 

conclude that g(x) g{y), as required. 

Part (2): Show that for each nonnegative integer b < g(x), there exists some 
y e six) with g(y) = b. A general proof of this part is fairly technical (see Berge 
[2], p. 25, for details). The practical side of this proof, discussed below, is finding a 
y with g(y) = 0 (a kernel vertex) and moving to it. A formal proof of part (2) is a 
generalization of the discussion below. ♦ 

Corollary 

The vertex (p \ , p 2 , , p m ) in the graph of an 777 -pile Nim game has the Grundy 
number p\+ p 2 + ■ ■ ■ + p m ■ Thus (p\, p 2 , . . . , p m ) is a kernel vertex if and only if 
Pi + Pi + • • • + Pm = 0. 

The reader should go back to the Nim game of two piles of two objects each in 
Example 2 and check that the Grundy numbers obtained for the vertices of the graph 
of that game are the digital sums of the pile sizes. 
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1=00 1 

2 = 0 1 0 

3 = 0 1 1 

4=1 0 0 

Figure 10.8 1 0 0=4 

Example 1: (continued) 

The Grundy number of the starting vertex of the Nim game in Figure 10.4 is the 
digital sum 1+2 + 3 + 4 = 4, as computed in Figure 10.8. The first player A wants 

to decrease the size of one of the piles so that the new digital sum is 0 (a kernel 

position). That is, A should alter the binary digits in one row of Figure 10.8 so that 
the sum (bottom) row is 0 0 0. For the sum row to become all 0s, every column now 
having an odd number of 1 s should have one of its digits changed (either a 1 to a 0 or 
a 0 to a 1) to make the number of Is in that column even. 

Since the sum row has a 1 in just the 2 2 column, we can change the (single) 1 in 
that column to a 0. That is, pile 4’s binary expansion should be changed from 1 0 0 to 
0 0 0, and so player A should remove all sticks in the fourth pile. This new position 
(1, 2, 3, 0) has a Grundy number of 0. Note that to make the number of Is even in 
the 2 2 column, we could not change any 0 to a 1, for the new binary expansion in the 
altered row would then be a larger number — an impossible move in Nim. ■ 


We now generalize the method of finding a vertex with Grundy number 0 given 
in the preceding example. Form a digital sum table as in Figure 10.8 for the current 
game position. Pick a row e having a 1 in the leftmost column that has an odd number 
of Is — that is, row e should have a 1 in the leftmost column with a 1 in the sum row. In 
row e, change the digit in every column having a 1 in the sum row. After this change 
of digits, every column will have an even number of Is, and so the sum row will be 
all 0s. Thus, this new position will be a kernel vertex. Note that since the leftmost 
digit that is changed in row e is a 1 (this is how row e was chosen), changing digits 
in row e will yield a smaller number, call it h. The first player should thus decrease 
the size of the eth pile to a size of h objects. 

Example 4: Another Game of Nim 

Consider the Nim game of four piles with two, three, four, and six sticks shown in 
Figure 10.9a. The digital sum table for the initial position is shown in Figure 10.9b. 
The 2 1 column is the leftmost column with a 1 in the sum row, and so we must change a 
row with a 1 in the 2 1 column. We can use the first, second, or fourth row. Suppose that 
we choose the first row. Then since the 2 1 and 2° columns in the sum row have Is, we 
change the digits in these columns in the first row. The new first row is 0 0 1 . Thus, the 
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2 2 2 ' 2 ° 


1 1 

2=0 

1 

0 

1 1 1 

3=0 

1 

1 

till 

4 = 1 

0 

0 

mum 

6 = 1 

! 

0 


0 1 1=3 

Figure 10.9 M (P) 

first player should reduce the size of the first pile to 1 . The reader should continue the 
play of this Nim game with a friend (or against oneself) to practice this kernel-finding 
rule. ■ 



10.2 EXERCISES 


1. Find the Grundy number of the initial position and make the first move in a 
winning strategy for the following Nim games: 


I I 

(a) HI 
I I I 
I I I I I 


I 

(b) III 
I I I I I 
I I I I I I I 


(c) 


I I 

I I I I 
I I I I 


I I 


I I 

(d) III 
I I I I 
I I I I I 


2. Suppose that no more than two sticks can be removed at a time from any pile. 
Repeat the games in Exercise 1 with this additional condition. 

3. Suppose that no more than i sticks can be removed at a time from the ;th pile 
(piles are numbered from top to bottom). Repeat Exercise 1 with this additional 
condition. 


4. Suppose that only one or two or five sticks can be removed from each pile. Repeat 
the games in Exercise 1 with this constraint. 

5. Suppose that only one or four sticks can be removed from each pile. Repeat the 
games in Exercise 1 with this constraint. 

6. For the Nim game in Exercise 1(c), find moves that yield positions with Grundy 
number equal to 


(a) 1 


(c) 3 


(b) 2 
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7 . Suppose three copies of the game in Example 2 of Section 11.1 are played 
simultaneously. Players stop adding money to a pile when the value of the pile 
is a square or exceeds 40. The player who adds the last amount to the last pile 
wins the money in all three piles. Note that there is a table of Grundy numbers 
for this game near the end of the previous section. 

(a) If initially there are 2<f in two piles and 1^ in the third pile, what is the Grundy 
number of this position and what is a correct winning move? 

(b) Find all kernel positions in which the sum of the money in the first two piles 
is less than I ()<t and the third pile is empty. 

(c) Using the table of Grundy numbers for the one-pile game near the end of 
Section 11.1, write a computer program to compute the next winning move 
in this three-pile game. 

8. Draw the direct sums involving the graphs shown below. 

(a) Gi + G, (b) G1 + G2 + G3 



9 . Prove the assertion in the proof of the theorem that c' + c ; = c' + dj if and only 
ifc i = dj. 

10 . Generalize the argument for finding kernel vertices in Nim (preceding Example 
4) to obtain part (2) of the proof of the theorem. 

11 . Consider the variation of Nim in which the last player to move loses. Show that 
the winning strategy is to play the regular last-player-wins Nim strategy but when 
only one pile has more than one stick now decrease that pile’s size to 1 instead 
of 0 (or to 0 instead of 1). 

12 . Write a computer program to play winning Nim. 

13 . Consider the following variation on Nim. Place C(n + 1,2) identical balls in a 
triangle, like bowling pins. Two players take turns removing any number of balls 
in a set that all lie on a straight line. The player to remove the last ball wins. 


o 

o o 
000 
0000 


n = 4 


(a) Find a winning first move for this game when n — 3. 

(b) Repeat for n = 4. 
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y 10.3 SUMMARY AND REFERENCES 

The first published analysis of Nim by C. Bouton [3] appeared in 1902. The Grundy 
function for progressively finite games was presented by P. Grundy [5] in 1939. It was 
independently discovered a few years earlier by the German mathematician Sprague. 

This chapter has only scratched the surface of the theory of finitely progressive 
games. Interested readers should turn to On Winning Ways by Berlekamp, Conway, 
and Guy [1]. The reader should also see the classic book about mathematics and 
games, Mathematical Recreations and Essays by Coxeter and Ball [4]. 

1. E. Berlekamp, J. Conway, and R. Guy, On Winning Ways, 2 volumes. Academic 
Press, New York, 1982. 

2 . C. Berge, The Theory of Graphs, Methuen-Wiley, New York, 1962. 

3. C. L. Bouton, “Nim, a game with a complete mathematical theory,” Ann. Math. 3 
(1902), 35-39. 

4. H. Coxeter and W. Ball, Mathematical Recreations and Essays, University of 
Toronto Press, 1972. 

5 . P. M. Grundy, “Mathematics and games,” Eureka, January, 1939. 



A Prelude opened this textbook with the puzzle Mastermind that introduced com- 
binatorial reasoning in a recreational setting. This Postlude looks at a cryptanalysis 
problem that is again of a recreational nature but also illustrates the less structured side 
of combinatorial reasoning as it often occurs in real-world problems. In particular, 
we will look at a simple cryptographic scheme in which the analysis of underlying 
combinatorial problems is complicated by the somewhat random pattern of letters in 
English text. 


P.l Letter Frequencies 

There have been many tables produced of the relative frequencies of letters in 
English writing, starting with Samuel Morse (of Morse code fame). We use fre- 
quencies averaged over several tables. 

Most Common Letters in English Text 
Vowels Consonants 

E 12% T 9% 

A, I, O 8% N, R, S 6-8% 

D, L 4% 

The least frequent letters, all consonants, are: 

J, Q, X, Z below .05% 

As we shall see, it is also useful to know which pairs of consecutive letters, called 
digraphs, are most frequent. The eight most common digraphs are 

Most frequent: TH 

Second most frequent: HE 

Next six most frequent: ER, RE and AN, EN, IN, ON 

N is unique among the very frequent letters in that close to 90% of its occurrences 
are preceded by a vowel; other frequent letters have a much wider range of other letters 
preceding them. Some other frequent digraphs that can be helpful are 

ES, SE ED, DE ST TE, TI, TO OF 
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Frequent consonants tend to appear beside vowels but vowels do not occur side 
by side often and similarly frequent consonants do not appear side by side often except 
for TH and ST. So there is a quasi-bipartite graph-like relationship with vowels as one 
set of vertices and frequent consonants as the other set of vertices in the bipartition), 
and the frequent digraphs are the edges. 

There is one triple of consecutive letters, called a trigraph, that stands out, 
namely, THE. THE is four times as common as any other trigraph in English text. It 
is frequent both as a three-letter word and as a part of other words. 

If we are given an encoded message, we could count the frequency of each letter 
in the message to determine single-letter frequencies. To get information of how often 
various letters occur before and after other letters, we build what is called a trigraphic 
frequency table. For each occurrence of a letter, we record the letter just preceding 
and the letter following this letter. To illustrate, consider the following cryptogram. 
Here spaces have been removed between words, but for readability letters are written 
in groups of five. 


Cryptogram 
FJYHP KKYRH 
JYRYD TQYCO 
OYRFI JSWYP 
YQFQI JPEYO 
RFJQI NYSVI 
OPKQR PIEQX 


YKYRF HYVYK 
JPMIY FJQIN 
TFYRK QIAYJ 
FJSFF JYQRO 
BVSOM SFQRB 
XSSCC PDYOQ 


PRQYI SFIFP 
YSVTP VFJYT 
QWYOQ RSVPD 
PPVIY FFQRB 
HCSII FJYVY 
RIQOY 


RNAVP PUDQC 
QVIFF QKYQR 
ONRPQ INTSI 
DQCCA VQRBP 
DQCCA YRPOY 


CAYJY COQRF 
FJYES IFIFM 
JQPRP MFIQO 
MFFJY AYIFQ 
CQWYV QYIPT 


The trigraphic frequency table for the cryptogram is given in Table P. 1 . Letter 
frequencies appear at the top of each column in Table P. 1 . When a trigraph is repeated 
in some letter’s column, the trigraph is underlined. To illustrate how the table is 
constructed, consider the beginning of the cryptogram: FJYHP KKY . . . For each 
occurrence of a letter, we enter the letter just before it and the letter just after it. For 
the first letter F, we enter .J in F’s column (the indicates that since F is the first 
letter of the message, no letter precedes it). For the second letter J in the message, 
we enter FY in J’s column. For the third letter Y, we enter JH in Y’s column. For the 
fourth letter H, we enter YP in H’s column; for the fifth letter P, we enter HK in P’s 
column. For the sixth letter K, we enter PK in K’s column. Since the seventh letter is 
also K, next in K’s column, we enter KY. 

For the frequent letters, their columns of trigraph data can be overwhelming, and 
so it is helpful to make a digraph table for each frequent letter, such as F, listing the 
frequency of letters that occur two or more times Bef ore F and After F. See Table P.2. 

We shall be referring to the data in Tables PI and P.2 repeatedly through this 
Postlude. 

Finally, we list the sequences of three or more letters that are repeated several 
times in the message. 


seven times: FJY three times: DQCCA, RFJ, QRB (also DQCCAY two times) 
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Table P.l Trigraph table for cryptogram 


6 

4 

12 

6 

3 

27 

0 

4 

22 

16 

7 

0 

5 

5 

13 

21 

31 

20 

13 

6 

1 

12 

3 

2 

37 

A 

B 

C 

D 

E 

F 

G 

H 

I 

J 

K 

L 

M 

N 

0 

P 

Q 

R 

S 

T 

U 

V 

W 

X 

Y 

NV 

RD 

QC 

UQ 

YS 

. J 


YP 

YS 

FY 

PK 


PI 

RA 

CQ 

HK 

RY 

YH 

IF 

DQ 

PD 

YY 

SY 

QX 

JH 

CY 

RP 

CA 

YT 

PY 

RH 


RY 

FF 

YY 

KY 


FO 

IY 

CJ 

KR 

DC 

YF 

YV 

VP 


AP 

QY 

XS 

KR 

IY 

IV 

YO 

PO 

IQ 

SI 


FY 

MY 

FY 

YY 


PF 

OR 

MY 

FR 

OR 

PQ 

El 

YQ 


ST 

QY 


HK 

CV 

RH 

YO 

BQ 


IP 


BC 

QN 

OP 

YP 


PF 

IT 

YQ 

VP 

TY 

PN 

JW 

PF 


PF 



KR 

YY 

HS 

QC 

YQ 


RJ 



VF 

FQ 

QY 


OS 

IY 

DN 

PU 

JI 

QF 

RV 

NS 


Ql 



HV 

CY 


CA 

PY 


YJ 



SF 

FY 

RQ 




QY 

JM 

QN 

YY 

TI 

PO 


SP 



VK 



HS 



VJ 



FF 

FY 

PQ 




YF 

TV 

TV 

QF 

JF 



PI 



Ql 



QC 



IF 



FJ 

IS 





RP 

YT 

FK 

YF 

YV 



AQ 



AJ 



CA 



FQ 



QA 

YQ 





SM 

VD 

YR 

YK 

VO 



SI 



JC 



YQ 



RJ 



QN 

IQ 





PY 

RQ 

KI 

QS 

MF 



BS 



JR 



SC 



II 



SJ 

IP 





TP 

QR 

JW 

NP 

Cl 



YY 



RD 



CP 



IM 



FQ 

FS 





YQ 

RM 

OR 

PP 

XS 



YQ 



QC 






RI 



QJ 

FY 





QY 

JE 

PI 

QO 

SC 






IF 






TY 



VF 

FY 






OP 

JP 

QB 







NS 






MI 



YF 

FQ 






PV 

10 

QB 







JT 






QQ 



QN 

FY 






BM 

YF 

QF 







KQ 






OJ 



VB 







RO 

FI 

QB 







JE 






SF 



SI 







IT 

YR 

YP 







OR 






FJ 



IF 







OK 

FR 

QP 







WP 






YF 



YP 







RI 

DC 

Ql 







FR 






FQ 



PE 







CD 

VR 








AJ 






MF 



RQ 








FR 








WO 






FJ 











JI 








OQ 






IQ 











FR 








EO 






RJ 











DC 








JQ 






SQ 











CW 








IF 






IJ 











VY 








JA 

















KR 








AI 

















EX 








NS 

















OR 








JV 

















IQ 








VD 


AR 

OC 

wv 

24 

DO 

0 . 


Note that longer repeats, such as DQCC A and DQCCAY, can be found by looking 
at trigraph (3-letter) repeats and concatenating these repeats together. That is, DQCCA 
is built from the repeated trigraphs of DQC, QCC, and CCA. Observe that this 5-letter 
sequence is probably a word, since the chances are extremely low that a repeated 
5 -letter sequence would be formed by a common ending of three different words 
followed by a common start of three other different words. 

Now we start the decoding process. We typically begin with the letters in the 
English word THE. The very frequent trigraph FJY is a perfect fit to be the encoding 
of THE, since (i) THE is the most frequent trigraph in English text and FJY is the most 
frequent trigraph in the cryptogram (occurring 7 times), (ii) E is the most frequent 
letter in English text and Y is the most frequent letter in our cryptogram, (iii) TH 
is the most frequent digraph in English text and FJ is the most frequent digraph in 
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Table P.2 Digraph repetitions for frequent letters in table P.l 


12 

27 

21 

16 

13 

22 

31 

20 

14 

12 

36 


C 

F 

I 

J 

O 

P 

Q 

R 

S 

V 

Y 


Bef 

Bef 

Bef 

Bef 

Bef 

Bef 

Bef 

Bef 

Bef 

Bef 

Bef 

Aft 

C4 

F4 

F4 

F10 

C2 

J2 

D3 

P3 

J2 

A2 

A4 

3C 

Q3 

16 

Q5 

13 

Q2 

G2 

F4 

Q10 

S2 

P2 

12 

2D 

Y3 

M2 

S3 

Y2 

Y3 

P2 

13 

Y6 

Y2 

S3 

J8 

2F 


R5 

V3 



R4 

J4 



Y3 

K3 

31 

Aft 

S3 

Y3 

Aft 

Aft 

V2 

K2 

Aft 

Aft 


N2 

2 J 

3A 

Y2 


2P 

2P 


02 

3B 

3F 

Aft 

04 

2K 

4C 


Aft 

4Q 

2Q 

Aft 

T2 

4F 

31 

31 

Q3 

30 

20 

Aft 

6F 

3S 

4Y 

2D 

V2 

4P 

3V 

2P 

V2 

3Q 


4F 

3 J 

8Y 


2M 

Y3 



2Q 

W3 

6R 


41 

3N 



2P 




2Y 


2S 


10J 

2Q 



2R 

Aft 





3V 


5Q 

2Y 



2T 

3C 







2V 41 
20 
10R 
3Y 
2W 


the cryptogram (occurring 10 times), and (iv) T is one of the most frequent letters in 
English text and F is one of the most frequent letters in our cryptogram. We write the 
information about the encoding of THE as 


Tp = Fc, Hp = Jc, Ep = Yc, 


where the P subscript stands for Plain text and the C subscript stands for Code text. 

Once we know that Ep = Yc , we can look for frequent letters that are rarely beside 
Y, keeping in mind that vowels do not occur side-by-side often. Looking through the 
digraph information in Figure P.2, we see that Yc has no repeated occurrences before 
or after Pc (Yc appears before Pc once and not at all after Pc). So Pc is extremely 
likely to be a vowel. To find another vowel, we can look at frequent code letters that 
have few occurrences of Yc and Pc before and after them. An excellent candidate 
vowel is Sc which has no Pc’s before or after it, no Yc’s after it, but two Yc’s before 
it. No other frequent letter has only two Yc’s beside it (before or after). 

Finally, recall that in English text the letter N is distinctive because it occurs 
almost exclusively with the frequent vowels (A, E, I, O) before it. One code letter 
has exactly this characteristic — namely. Re- Nineteen of the 20 occurrences of Rc 
are preceded by Pc (three times), Qc (10 times), and Yc (six times). We already 
identified Pc and Yc as vowels. So Np = Rc, and Qc is another vowel. While Qc has 
Yc(= E P ) occurring three times before and three times after it, Qc has no Pc before or 
after it and has no Sc before or after it; recall that Sc was identified as a likely vowel 
in the preceding paragraph. Thus, Yc(= Ep), Pc, Qc and Sc are extremely likely to 
be the four frequent vowels Ap, Ep, Ip, and Op, although we do not know which 
vowel corresponds to each of these code letters other than that Yc = Ep. Among the 
four most frequent English consonants Np, Rp, Sp, and Tp, we have identified that 
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T P = Fc and N P = Re- In sum, we have made a very good start at breaking this 
cryptogram by only considering aggregated data about the frequency of single letters, 
digraphs, and one trigraph. 


P.2 Keyword Transpose Encoding 

In this Postlude we shall consider the following scheme for encoding text called 
keyword transpose encoding. Interested readers can learn more about the basics 
of cryptography and some of the associated mathematics from the references at the 
end of this Postlude or from web sources. We use a given keyword, suppose it is 
MORNING, to create a one-to-one mapping of plain-text letter to code letters. The 
first step is to build an array of letters in which the first row consists of the letters in 
MORNING, with repeated letters omitted (that is, we drop the second N), and the 
remaining rows are constructed by listing the rest of the 26 letters (those not in the 
keyword) in alphabetical order row-by-row, as follows: 

M O R N I G 

A B C D E F 

H J K L P Q 

S T U V W X 

Y Z 

The number of columns in the array, which we denote by n co i, is the number 
of distinct letters in the keyword. MORNING has six distinct letters in it, and so 
n co i — 6 for this array. The number of rows in the array, denoted n mw , equals |"26/n co /] , 
where [] is used to indicate that we round the quotient up to the next whole number. 
With the keyword MORNING, n mw = \26/n co {\ = (26/6) = 5. 

Now we create the encoding sequence by listing code letters in the array taken 
column-by-column. We place this sequence of code letters underneath the plain-text 
alphabet: 

ABCDEFGHIJKLMNOPQRSTUVWXYZ 

MAHSYOBJTZRCKUNDLVIEPWGFQX 

That is, the plain-text letters A P , B P , C P , D P , E P are encoded with the code letters 
that appear going down the first column of the array: Me, Ac, He, Sc, Ye- This 
encoding is called a transpose scheme because we build the array by listing the code 
letters in some sort of row-by-row scheme and we then transpose the array by taking 
the code letters column-by-column to get the encoding sequence. This process gives 
a fairly good scrambling of plain-text letters into code letters, and can be generated 
from the memory of a simple keyword. 

Now we consider how we can use the structure of a keyword transpose scheme 
to help us decipher the message. The first step is to determine n co i, the number of 
columns in the array or equivalently the length of the keyword (not counting repeated 
letters). Let us consider how the plain-text letters are assigned to locations in the 
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keyword transpose array for keywords of length 6, length 7, and length 8. We do not 
know the code letters in the array, so we leave the code positions blank. However, 
we do know where the plain-text letters go. In each of these arrays, we insert the 
encoding of the letters in the English word THE that we determined in the previous 
section; namely, Tp = Fc, Hp = Jc, Ep = Yc, 


Table P.3: Possible Keyword Transpose Arrays 
Table P.3a: Array for n co i — 6 


Plain text: 

A 

F 

K 

0 

S 

w 



Code text: 









Plain text: 

B 

G 

L 

P 

T 

X 



Code text : 





F 




Plain text: 

C 

H 

M 

Q 

U 

Y 



Code text : 


J 







Plain text: 

D 

I 

N 

R 

V 

Z 



Code text: 









Plain text: 

E 

J 







Code text : 

Y 

- 







Table P.3b: Array for n co/ — 

7 






Plain text: 

A 

E 

I 

M 

Q 

U 

X 


Code text : 


Y 







Plain text: 

B 

F 

J 

N 

R 

V 

Y 


Code text: 









Plain text: 

C 

G 

K 

0 

S 

W 

Z 


Code text : 









Plain text: 

D 

H 

L 

P 

T 




Code text : 

— 

J 

— 

— 

F 




Table P.3c: Array for n co/ 


8 






Plain text: 

A 

E 

I 

L 

0 

R 

U 

X 

Code text : 


Y 







Plain text: 

B 

F 

J 

M 

P 

S 

V 

Y 

Code text : 









Plain text: 

D 

G 

K 

N 

Q 

T 

W 

Z 

Code text : 






F 



Plain text: 

A 

H 








Code text : J 
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Remember that the code letters in the first row of the array spell the keyword 
(with repeated letters omitted) and then the succeeding rows list the non-keyword 
code letters in alphabetical order. 

Consider the encoding Tp = Fc- Since F is about a quarter of the way through 
the alphabet, if code letter Fc is not in the keyword (appearing in the first row of 
the array), then Fc will appear about a quarter of the way through the alphabetical 
listing of the other code letters in the remaining rows of the array. We look at where 
Fc is placed in the three different encoding arrays given above when we make the 
assignment Tp = Fc. For n co \ — 6, Tp is located towards the end of the second row — 
i.e., corresponding to a code letter not in the keyword that is about a quarter of the 
way through the alphabet. This is exactly where Fc should appear if Fc is not in 
the keyword. In the second scheme where n co i = 7, T P is located at the very end of 
the array and would be encoded as the last letter not in the keyword. Thus Tp = Fc 
is impossible for this array. The situation is no better when n co i = 8, for again Tp 
is located near the end of the listing of the letters not in the keyword. We conclude, 
based on the encoding Tp = Fc, that only the keyword transpose encoding array with 
n co i — 6 is possible among these three arrays. 

Next we perform the same analysis for the encoding of H P and of E P . Jc (=Hp) 
is about 40% of the way through the alphabet, and, since J is a very infrequent letter, 
Jc is unlikely to appear in a keyword. Then the encoding Hp = Jc requires that Hp 
appear in a position that corresponds to a code letter that is 40% of the way through the 
listing of non-keyword code letters. For n co i = 6, Hp is well placed (see the 6-column 
array). For n co i = 7 and n <:ol = 8, H P has a location that corresponds to a letter near 
the end of the alphabet — impossible. So again, only for n co i = 6, does Hp = Jc fit the 
encoding array. 

Finally we look at the encoding Ep = Yc- Y is a relatively infrequent letter that 
is the next-to-last letter in the alphabet. The code letters Yc and Zc are both unlikely 
to be in the keyword, so the code letter Yc will normally appear as the next to the last 
letter in the non-keyword code letters. For n co i = 6, Ep is positioned at the next-to- 
letter non-keyword code letter. Exactly the right place for Ep = Yc. For n co i = 7 and 
n co i — 8, Ep is assigned to the second letter in the keyword. While there are words 
in which Y is the second letter, such as CYPHER, the odds of Y not being in the 
keyword are vastly greater than Y being the second letter in the keyword. 

When we look at how the encodings of each of T P = Fc, H P = Jc, and E P = 
Yc fit the three different arrays, we see that the array with n co i — 6 is the only one 
that makes sense. We note that if we had looked at other arrays with greater or fewer 
columns than the above three arrays, we would have found that Tp or Hp would 
have to correspond to a code letter not in the keyword that was near the end of the 
alphabet — impossible given T P = Fc and H P = Jc- So we can be very confident that 
n co i = 6. 

As an aside, note that if Yc is the most frequent code letter in a cryptogram, we 
can be quite certain that Ep = Yc and n co i = 6. Also if a 7-letter keyword is used, 
then Tp = Zc (see Table P.3b); furthermore, this is the only array where Zc can be the 
encoding of a very frequent English letter. Thus, if Zc is a very frequent code letter 
in a cryptogram, then Tp = Zc and n co i = 7. 
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The underlying logical argument here was used many times in the graph theory 
part of this text, where it was called the AC Principle: Assumptions generate helpful 
Consequences. An example of the AC Principle is that if a graph is planar, then there 
are a number of properties that such a graph must have, such as e < 3v - 6. If one 
of these properties does not hold, then the graph cannot be planar. Here, we tried 
assuming different lengths of the keyword, and then checked for each length whether 
the known encodings for the plain-text letters T, H, and E were consistent with the 
required position of their code letters in the different arrays. When the encodings did 
not fit an array, we could eliminate the associated keyword lengths. 

Let us continue with our efforts to determine the encoding. In the previous section, 
we used trigraphic analysis to show that Np = Rc and that Pc, Qc , and Sc were very 
likely to be the three other frequent vowels A P , I P , and 0 P . We can now conclude 
that J P = Zc since J P must be encoded as the last of the non-keyword letters (since 
we know E P = Yc). We add this information to the 6-letter keyword transpose array, 
obtaining Table P.3a(i). 


Table P. 3a (i) 


Plain text: A F 

Code text: {P,Q,S} 


K O S W 

{P,Q,S} 


Plain text : B 
Code text : 

Plain text: C 
Code text : 


G L P T X 

F 

H M Q U Y 

J 


Plain text: D 
Code text : 

Plain text: E 
Code text : Y 


I NR 
{P,Q,S} R 

J 

Z 


V z 


Determining the size of the keyword transpose array places tremendous con- 
straints on encoding possibilities. While it can be hard to determine the encoding 
for plain-text letters that appear in the keyword row of the array (its first row), the 
encoding of several frequent plain-text letters in other rows of the array is usually 
possible. For example, consider the plain-text vowel I P , which is the other frequent 
vowel (besides E P ) that does not appear in the first row. Ignore for a moment that I P 
is a vowel and concentrate solely on its position in the array. I P appears just before 
N P (= Rc) in the fourth row of our encoding array, so I P corresponds to a code letter 
that appears before, or almost before, Rc in the list of non-keyword code letters. The 
letter just before R in the alphabet is Q and since Q is an extremely infrequent letter 
and unlikely to be in the keyword, it is virtually certain that Qc is the code letter just 
before Rc, in the listing of non-keyword code letters — that is, Ip = Qc- We confirm 
this by noting that Qc is known to be a frequent vowel and hence is one of our three 
possibilities for the encoding of I P . 
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Next look at R P , the other frequent consonant (besides T P , and N P ) that does not 
appear in the first row. R P appears just after N P = Rc in the fourth row. The letter 
following R in the alphabet is S, but Sc is known to be a vowel. The next two code 
letters Tc and Uc have too low a frequency (see Table Rl) to be R P . Next is Vc, the 
only remaining (in alphabetical order) high-frequency letter except for Yc, which is 
known to encode E P . So based on frequency alone, Vc is the only possible encoding 
of R P . Moreover, Vc has repeated occurrences of Yc (= E P ) both before and after it, 
matching the fact that ER and RE are both frequent English digraphs. This solidifies 
the assignment R P = Vc- 

Now note that there are two (undetermined) code-letter positions at the end of 
the fourth row following Vc (= Rp); also we know that the fifth row starts with Yc 
(= E P ). Since there are just two letters, W and X, in the alphabet between V and Y, 
we conclude that the two code letters at the end of the fourth row are W followed by 
Z; that is V P = Wc and Z P = Xf. So we now have determined five of the six code 
letters in the fourth row 

Fourth row of array: 

Plain text: D I N R V Z 

Code text: _ Q R V W X 

The other two frequent code letters that are known to be vowels, Pc and Sc, must 
correspond to A P and O p , which are in the keyword. Also the fact that Tc and Uc are 
missing in the fourth row means that these code letters are also in the keyword. 

Low-frequency letters can also be useful in finding entries in this array. For 
example, Q p is a very infrequent English letter and appears in the third row of our 
array. Q P will be encoded by an infrequent code letter that occurs soon after Jc, 
the encoding of H P , the second code letter in the third row. Looking in Table P. 1 at 
the frequencies of letters after Jc, Lc stands out with zero frequency; all other code 
letters near Lc are at least moderately frequent. Thus it is extremely likely that Q P 
= Lc- Since there is only one letter between Jc and Lc in the alphabet, the position 
in-between Jc and Lc in the third row must be occupied by Kf. That is, M P = Kc- 

We now rewrite the array in Table P.3a(ii) with the code words we have determined 
so far. 


Table P. 3a (ii) 


Plain text : 

A 

F 

K 

0 

S 

w 




Code text : 

{P,S} 



{P.S} 



(T and 

u 

also known 

Plain text : 

B 

G 

L 

p 

T 

X 

to 

be 

in keyword) 

Code text : 





F 





Plain text : 

C 

H 

M 

Q 

U 

Y 




Code text : 


J 

K 

L 






Plain text : 

D 

I 

N 

R 

V 

Z 




Code text : 


Q 

R 

V 

W 

X 





Plain text : E J 

Code text : Y Z 
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We have determined the encodings of many plain-text letters. As shown in 
Table P.3a(ii), we know four of the six letters in the keyword, and so there are only 
two additional code letters that belong in the keyword row. Let us identify where 
in the remaining rows these two remaining code letters in the keyword must occur. 
There are three sequences of missing code letters: 

(i) There are four unfilled code letter positions in the second row preceding Fc, and 
there are five letters of the alphabet preceding F. Thus one of the code letters Ac, 
Be, Cc, Dc, Ec must be in the keyword. 

(ii) There are two unfilled code letter positions between Fc and Jc, and there are 
three letters of the alphabet between F and J. Thus one of the code letters Gc, 
He, I must be in the keyword. 

(iii) There are three unfilled code positions between Lc and Qc, and there are four 
letters of the alphabet between L and Q. However, we know that one of these 
letters, Pc, must be in the keyword. Then the other three letters must occupy the 
three unfilled positions. That is. 


Up = Me, Yp = Nc, and Dp = Oc, 

Let us turn now to the two plain-text vowels in the first row of the array, Ap and 
Op, that we know correspond collectively to the code letters. Pc and Sc- One of the 
ways to try to identify the encoding of Ap is to look in the message for common short 
words that consist an A in combination with other letters that are already decoded. 
Such a word is THAT, since the T P and H P are known. Another consecutive pair of 
common words is HAVE BEEN (currently we know the encoding of all these letters 
except Ap and Bp). In the fourth row of our message starting with the eleventh letter 
we find a possible (THAT)p, namely the sequence (FJSF)c- Moreover, these four 
code letters are followed by (FJY)c, which we know is the encoding of (THE)p. The 
odds of the phrase “THAT THE” in our plain-text message — i.e. Ap = Sc — are vastly 
greater than that of a word ending in “THIT” is followed by “THE” — i.e., A P = Pc- 
So we conclude with high confidence that Ap = Sc- It then follows that the other 
plain-text vowel in the keyword row Op must be encoded as Pc- As an aside, we note 
there is no pair of consecutive Yc’s (= Ep) in this cryptogram, and hence no point in 
trying to find an encoding of HAVE BEEN. 

We have found the encodings of the four frequent plain-text vowels and all the 
most frequent plain-text consonants except Sp. There is just one code letter occurring 
more than 12 times that we have not identified — namely, Ic, which occurs 21 times. It 
is natural to look for digraph information to confirm that Sp = Ic - The list of frequent 
English digraphs near the beginning of this Postlude contained SE, ES, and ST. And 
indeed in the digraph information for Ic we find that Yc (= E P ) both precedes and 
follows Ic several times and that Fc (= T P ) follows Ic six times. Based on the high 
frequency of Ic and these digraphs with Ic, it is extremely likely that I c = Sp. Observe 
also that for the two missing non-keyword code letters between Fcand Jc in the array, 
there are now only two choices of code letters: Gc and He, since Ic (= Sp) is in the 
keyword row. Thus Xp = Gc and Cp = He- 
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The only moderately frequent plain-text letter we have not identified now is L P , 
and the only unidentified code letter with moderate frequency is Cc, which occurs 
12 times. The location in the array is good for the match Cc = L P , since this is the 
assignment that would result if Ac, Be, or Cc were all non-keyword letters, so that 
Cc would be the third code letter in the second row of the array. Further, no other 
unidentified code letter occurs more than six times. However, there are no frequent 
English digraphs with L to look for. Based on the frequency of Cc and its position in 
the array, the match Cc = L P looks good. 

There is one additional piece of information to use. Recall that we found the 
repeated sequence (DQCCAY)c in our message. If Cc = L P , then we are forced by 
the missing letters in the beginning of the second row of the array to assign Ac = B P 
and Be = C P . Then we decode 

(DQCCAY)c as (JLLBE) P 


To help us, we note that (FJ Y V Y)c = (THERE) P immediately precedes one of the 
occurrences of (DQCCAY)c yielding the decoded sequence of letters THERE _ILLBE. 
We recognize the phrase THERE WILL BE, confirming that Cc = L P . We have 
additionally determined that Dc = W P . Back in the second row of the array, the final 
missing code letter can be assigned (now that we know Dc is in the keyword). Thus 
Ec = P p . 

We write the array with all the encodings we have identified: 


Table P. 3 (iii) 







Plain text : 

A 

F 

K 

0 

S 

w 

Code text : 

S 



P 

I 

D 

Plain text : 

B 

G 

L 

P 

T 

X 

Code text : 

A 

B 

C 

E 

F 

G 

Plain text : 

C 

H 

M 

Q 

U 

Y 

Code text : 

H 

J 

K 

L 

M 

N 

Plain text : 

D 

I 

N 

R 

V 

Z 

Code text : 

0 

Q 

R 

V 

W 

X 

Plain text : 

E 

J 





Code text : 

Y 

z 






(T and U are also 


in keyword) 


We note that we have been able to decode all but two of the code letters without 
determining the keyword. We accomplished this by using the structure of the encod- 
ing array, frequency data for code letters and digraphs, and the repeated sequence 
(DQCCAY)c- The only time we looked at the cryptogram to guess a particular word 
involved the encoding THAT when we knew the encoding of T and H. Finishing up 
the decoding, we now easily deduce that the keyword is STUPID. 

One does not always get a near-complete decoding of the code letters in a cryp- 
togram by applying the above methods, as occurred here. However, it is virtually 
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always possible to determine the encoding of THE and N. Then the number of columns 
in the keyword transpose array can be determined, and from the array structure, some 
other frequent code letters can be decoded. If further progress is difficult, one can 
substitute into the cryptogram the plain-text decodings of all the known code letters. 
From all the resulting word fragments, the plain-text equivalents for other code letters 
can be deduced and then confirmed by checking that these decodings fit properly into 
the keyword transpose array. Alternating between using the array structure and using 
word fragments formed by deciphered code letters, one can normally decode any 
message. In the very worst case, one might have to guess the plain-text equivalent 
for one or two frequent code letters and then apply these proposed decodings in the 
cryptogram to see if they produce words and word fragments that validate the guesses. 

Readers will hopefully recognize in the preceding reasoning the same sorts of 
step-by-step analysis, aided by insights unique to the particular problem at hand, that 
arose repeatedly in the graph theory and counting chapters of this text. The important 
difference here is that we face the additional challenge of randomness, a complication 
that characterizes many of the applications of combinatorial reasoning in real-world 
problems. Preparing readers to analyze the combinatorial situations they will face in 
future employment is the ultimate goal of this text. 

POSTLUDE EXERCISES: 

The following cryptograms are all encoded using the keyword transpose method 
discussed above. 

1. Decode this cryptogram: 

TDHQP HKFHZ PIYHF KNOYI ZHFAH EDLAI EXZTZ OEZQE XXAHT AXHZN 

YHFBN FOTXX ZWHPJ PZHOP YFNLF TOOED LTDVT YYXEK TZEND PYFNL 

FTOOE DLBNF TXXNZ WHFKN OYIZH FPEZE PHPZE OTZHV ZWTZE DZQNJ 

HTFPZ WHFHQ EXXND XJAHN DHZWN IPTDV CNAPQ NFXVQ EVHBN FKNOY 

IZHFY FNLFT OHFD 


2. Decode this cryptogram (note: the sequence JPENFYV occurs five times): 


FKYVY 

STDDB 

YQYLP 

UHFVG 

FTPUH 

GFFKY 

JPLEN 

FYVJY 

UFYVU 

YIFSY 

YAFPE 

VPFYH 

FFKYV 

YHFVT 

JFTPU 

HPUYU 

VPDDL 

YUFHT 

UJPLE 

NFYVH 

JTYUJ 

YJPNV 

HYHTU 

FKYRG 

DDFKY 

JPLEN 

FYVJY 

UFYVS 

TDDBY 

JDPHY 

QQNVT 

UCFKY 

EVPFY 

HFHGU 

QJNW 

YUFJP 

LENFY 

VHJTY 

UJYHF 

NQYUF 

HFPDQ 

FPVNU 

FRY TV 

EVPCV 

GLHGF 

JPLEN 

FYVHG 

FPFKY 

VDPUC 

THDGU 

QHJKP 

PDH 









3. Decode this cryptogram: 







PCFIQ 

BZWOR 

ZEVOF 

ZRMFZ 

WMRKG 

CYWZW 

OQGIJ 

QEGRO 

SOGOF 

QZIOZ 

OTYOG 

MOFJO 

VMFRQ 

XHMFK 

JGIYZ 

QKGCP 

RSWOF 

YGQBO 

RRQGZ 

EJNOG 

CRRMK 

FOVZW 

MRWQP 

OSQGN 

YGQAX 

OPZWO 

GOSOG 

OLEMZ 

OCBOS 

RZEVO 

FZRSW 

QWCHO 

AOOFR 

ZEPYO 

VAIZW 

MRNOI 

SQGVZ 

GCFRY 

QROOF 

JQVMF 


KRJWO POKQQ VXEJN 
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4. Decode this cryptogram: 


RPQPO 

LTPUG 

KYDVP 

PEPEG 

KYBVH 

DKGKY 

PVQVY 

FOCGG 

KHGGK 

YTOMA 

YVPEY 

RBYFS 

THDCH 

THVBV 

HDKSF 

CYFFG 

KHTPV 

YNOHC 

GPGKV 

YYGSM 

YFGKY 

TOMAY 

VPEWY 

VGSJY 

FSTGK 

YBVHD 

KMSTO 

FFSIS 

EGKYB 

VHDKS 

FASDH 

VGSGY 

GKYTO 

MAYVP 

EYRBY 

FSFAP 

OTRYR 

AQGUP 

GSMYF 

GKYTO 

MAYVP 

EWYVG 

SJYFM 

STOFE 

POV 






5. Decode this cryptogram: 







GKXUX 

SLTIG 

FNINX 

TGGID 

ILXSA 

QIPYF 

TGGID 

XFUTK 

IYGKX 

RXJIR 

STCIA 

GKXJI 

PULXJ 

UQEGI 

CUFNL 

SLLPE 

EILXR 

GIBXR 

ITXQI 

PASUL 

GTXXR 

GIBPS 

DRGKX 

GUSCU 

FEKGF 

BDXFT 

RGKXT 

DIIMA 

IUGKX 

YIURG 

KXTXH 

GGUQG 

IASTR 

GKXVI 

YXDLB 

QDIIM 

STCAI 

UAUXO 

PXTGJ 

IRXDX 

GGXUL 

GKFGR 

ITIGF 

EEXFU 

AUXOP 

XTGDQ 

BXLSR 

XGKXJ 

IRXDX 

GGXUG 

KFGLG 

FTRLA 

IUGKX 

EDFST 

GXHGD 

XGGXU 

XTXHG 

GUQGI 

ASTRY 

KSJKJ 

IRXDX 

GGXUS 

LTCII 

RDPJM 







6. Decode this cryptogram: 







JDBXB 

CXBXY 

UWXIJ 

DCJTX 

WRBII 

WXYJQ 

EBXKS 

WYIB 

FBKQX 

GTJWM 

XCUIS 

FJDBM 

XCTDJ 

DBWXG 

JCEBD 

WUBRS 

FCVJD 

CJKSV 

VABNB 

XGDCX 

OJWOB 

QWOBD 

WKBNB 

XIJYO 

BKJIK 

DWEFW 

KDSUK 

BWAB 

VSBNB 

JDCJJ 

DBQXG 

TJWMX 

CUIJD 

SIGBC 

XKSW 

ABJDB 

I CUB I 

SWGU 

BIICM 

BIJDC 

JDBDC 

IYIBO 

SFTCI 

JGBCX 

I 






7. Decode this cryptogram: 







JUFYR 

QYPXI 

JNSIX 

XAMPE 

VNENV 

AHYEN 

QIVND 

ZFUEA 

REKNE 

ZPLNN 

YUXXU 

XJPWP 

XIJAN 

GNEUK 

NQPRZ 

PBZWN 

JPFFN 

DJNFN 

DZNMN 

EJIQN 

QZWIQ 

HNUEI 

DZWNY 

UQZQZ 

RVNDZ 

QWUGN 

ANNDG 

NEHAP 

IOZNE 

PRQUD 

VPBZN 

DIDZN 

EERYZ 

NUJPF 

FNDJN 

FNDZQ 

YNULN 

EQQYN 

JIUXN 

XNJZE 

PDIJV 

NGIJN 

QZPVN 

ZNJZW 

IVVND 

JUDQP 

BANNE 

SIXXA 

NRQNV 


8. Decode this cryptogram: 







FKXUX 

JITFS 

TOXFI 

BXUXE 

IUFLF 

KHFFK 

XOTSF 

XRLFH 

FXLHF 

FIUTX 

QCXTX 

UHDYS 

DDHTT 

IOTJX 

FKHFF 

KXMIL 

FYHTF 

XRJUS 

MSTHD 

LSTFK 

XOTSF 

XRLFH 

FXLHU 

XTIYH 

DDAIU 

MXUIA 

ASJXU 

LIAAS 

THTJS 

HDSTL 

FSFOF 

SITLF 

KXKOT 

FAIUF 

KXLXE 

XIEDX 

YSDDB 

XSTFX 

UTHFS 

ITHDL 

STJXM 

HTQIA 

FKXMK 

HVXXL 

JHEXR 

AUIMF 

KXJIO 

TFUQ 



9. Decode this cryptogram: 







ZWEUE 

BRFRZ 

RNFNB 

VLGVY 

WUEYE 

FUONB 

ZWEZE 

SZANN 

MJNCV 

GEGEV 

URFLZ 

WEGEV 

GEOEH 

EGVXU 

RBBEG 

EFZUE 

BRFRZ 

RNFOZ 

WEEFL 

RFEEG 

OCOEN 

FEOEZ 

NBZEG 

TOZWE 

TVZWE 

TVZRK 

RVFOC 

OEVFN 

ZWEGO 

EZVFU 

ZWEYW 

JORKR 

ozoco 

EVFNZ 

WEGOE 

ZZWEU 

EBRFR 

ZRNFJ 

NCOWN 

CXUCO 

EROZW 

ENFEZ 

WVZRO 

EVORE 

OZBNG 

JNCGV 

YYXRK 

VZRNF 
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10 . Decode this short cryptogram (warning: E is not the most frequent letter): 

MYTKI JIRUL AZOAH MIJAC UYGII JIUJA CHETR JMRUY MJFAG RMRPJ 
FTMEX ALAZU YMRQM OAZEX OAZRU ARTRI TGELG IJHAR JITJU AVYMO 
YVTLV IFMPY UXIOM XIUAP A 
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A.l SET THEORY 

A set is a collection of distinct objects. In contrast to a sequence of objects, a set is 
unordered. Usually we refer to the objects in a set as the elements, or members, of 
the set. These elements may themselves be sets, as in the set of all 5-card hands; each 
hand is a set of five cards. A family is a collection in which multiple appearances of 
objects is allowed; for example, {a, a, a, b. b, c). 

A set S is a subset of set T if every element of S is also an element of T. In the 
case of a 5-card hand, we are working with a 5-card subset of the set of all 52 cards. Any 
set is trivially a subset of itself. A proper subset is a nonempty subset with fewer ele- 
ments than the whole set. Two sets with no common members are called disjoint sets. 

This text uses capital letters to denote sets and lowercase letters to denote elements 
(unless the elements are themselves sets). The number of elements in a set S is denoted 
by N(S) or |S|. The symbol e represents set membership, for example, x e S means 
that x is an element of 5; and x S means that x is not a member of S. The symbol C 
represents subset containment — for example, T C S means that T is a subset of S. 
There are three ways to define a set with formal mathematical notation: 

1. By listing its elements, as in S = {x\, X 2 , * 3 , X 4 } or T = {{1, — 1}, {2,-2}, 
{3, —3), {4, —4}, {5, —5}, . . .} — implicitly, T is the set of all pairs of a positive 
integer and its negative 

2. By a defining property, as in P — { p \ p is a person taking this course} or R = 
{ r | there exist integers 5 and 1 with t ^ 0 and r — s/t } — R is the set of all rational 
numbers 

3. As the result of some operation(s) on other sets (see below) 

There are two special sets we often use: the empty, or null, set, written 0; and 
the universal set of all objects currently under consideration, written %, 

It is important to bear in mind that in many real-world problems, sets cannot be 
precisely defined or enumerated. For example, the set of all ways in which a large 
computer program can fail is ill defined. A census of the population of the United 
States involves substantial error for several important subcategories of the populace, 
yet the official United States population was given in 1990 as 248,709,873. Beware 
of any calculations based on imprecise sets ! 
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The three basic operations on sets that we will use are 

1. The intersection of S and T . S n T = [x e Tl | x e S and x eT] 

2. The union of S and T, SUT = {x e Tl | x e S or x e TJ 

3. The complement of S, S = {x e Tl x ^ S } 

A fourth operation that is sometimes useful is 

4. The difference of S of minus T, S — T = {x e Tl \x e S and x £ T] 

Observe that S — T can be expressed in terms of the preceding operations as S — T = 
SC\T. 


Example 1: Selecting Calculators 

A sample of eight different brands of calculators consists of four machines that have 
rechargeable batteries and four that do not. Also, four out of the eight have memory. 
We want to select four calculators to be taken apart for thorough analysis. Half of this 
set of four should be rechargeable and half should have memory. How many ways 
can such a set of four machines be chosen from the eight brands? 

To answer this question, we must know how many machines are rechargeable 
and have memory, how many are rechargeable and have no memory, and so on. 
That is, if R is the set of rechargeable machines and M the set of machines with 
memory, then we need information about the sizes of the intersections N(Rl T M), 
N(R fl M), N(R fl M), and N(R IT M). Assume that there are two machines in each 
of these four subsets. See Figure Al.l. Then one strategy to get the desired mix of 
four machines would be to pick one machine from each category in Figure Al.l 
(2x2x2x2=16 choices). There are two other strategies for getting the four ma- 
chines from Figure Al.l (see Exercise 4 at the end of this section). ■ 


The study of set expressions involving the foregoing set operations and 
associated laws is called Boolean algebra. Three of the most important laws of 
Boolean algebra are 

BA1. 5=1 
BA2. S n T = S U T 
BA3. SUT = S(lf 

To visualize set expressions, we use a picture called a Venn diagram. Figure 
A 1.2a shows a Venn diagram for the sets S and T. The whole rectangle represents 


No 

Memory memory 


Rechargeable 


2 2 


2 2 


Figure Al.l 


Not 

rechargeable 
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Figure A1.2 



the universe 4l of all elements under consideration. The circles represent the sets S 
and T. In Figure A 1 ,2b, the darkened region represents SDT and the striped area 
(SU 7)- (SflT). Note that in particular problems, certain regions in a Venn diagram 
may be empty sets. 

The next example illustrates how laws BA2 and BA3 can simplify a counting 
problem. 


Example 2: Counting with Boolean Algebra 

Consider the universe 41 of all 52 2 outcomes obtained by picking one card from a 
deck of 52 playing cards, replacing that card, and then again picking a second card 
(possibly the same card). Suppose that we want to compute the size of the set Q of 
outcomes with at least one spade or at least one heart. Let S be the set of outcomes 
with a void in spades — that is, in which no spade is chosen on either pick. Let H be 
the set of outcomes with a void in hearts. Write the set Q as an expression in terms 
of S and H, and calculate N(Q). 

The set Q equals S U II , the union of the set of outcomes with one or more spades 
(not a void in spades) and the set of outcomes with one or more hearts. The set S U H 
is the shaded area in Figure A1.3. By BA3, S U H = S fl H , where S fl H is the set 
of outcomes with no spades and no hearts (S fl H is the unshaded region in Figure 
A 1 .3). That is, S fl H is the set of outcomes where each pick is one of the 26 diamonds 
or clubs. Thus N(S n H) = 26 2 . By Figure A1.2, N(S~nH = V(4l) - N(S D H) = 
52 2 - 26 2 = 2704 - 676 = 2028, and so N(S U 77) = N(S(1H) = 2028. ■ 


Set theory, and, more generally, nonnumerical mathematics, were studied little 
until the nineteenth century. G. Peacock’s Treatise on Algebra, published in 1830, 
first suggested that the symbols for objects in algebra could represent nonnumeric 
entities. A. De Morgan discussed a similar generalization for algebraic operations 
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a few years later. G. Boole’s Investigation of the Laws of Thought (1854) extended 
and formalized Peacock’s and De Morgan’s work to present a formal algebra of 
sets and logic. The great philosopher-mathematician Bertrand Russell has written, 
“Pure Mathematics was discovered by Boole.” Subsequently it was found that nu- 
merical mathematics also needs to be defined in terms of set theory in order to 
have a proper mathematical foundation (see Halmos’s Naive Set Theory). See Chap- 
ter 26 of Boyer’s History of Mathematics for more details about the history of set 
theory. 


EXERCISES 



1. Let A be the set of all positive integers less than 30. Let B be the set of all positive 
integers that end in 7 or 2. Let C be the set of all multiples of 3. List the numbers 
in the following sets: 


(a) A n (B n C) (b) A D (B U C) (c) An(BUC) (d) A-(BO C) 


2. If A = {1,2,4, 7,8}, 5 = {1,5,7, 9}, and C = { 3,7, 8,9} and % = {1, 2, 3, 4, 
5, 6, 7, 8, 9, 10}, then find set expressions (if possible) equal to 

(a) {7} (c) {1} (e){4,8, 10} 

(b) {6, 10} (d) {2, 7, 9} (f) {3, 5, 6, 7, 9, 10} 


3. Suppose that in Example 1 we were given only the additional information that 
two calculators have memory but are not rechargeable. Show that we can now 
deduce how many machines are in each of the other three boxes in Figure ALL 

4. What are the other two strategies in Example 1 to pick a subset of four 
calculators, two of which are rechargeable and two of which have memory, 
given the numbers in Figure A1 . 1 ? How many choices are there for each of these 
ways? 

5. Suppose we are given a group of 20 people, 13 of whom are women. Suppose 
8 of the women are married. In each of the following cases, tell whether the 
additional piece of information is sufficient to determine the number of married 
men. If it is, give this number. 

(a) There are 12 people who are either married or male (or both). 

(b) There are 8 people who are unmarried. 

(c) There are 15 people who are female or unmarried. 

6. Label each region of the Venn diagram in Figure A 1.2a with the set expression 
it represents (e.g., the region where both sets intersect would be labeled A fl B). 

7. Draw Venn diagrams and shade the area representing the following sets: 

(a ) A- B (b) AUB (c) (ALLS) n (AfVB) (d) A-(B-A) 

8. Write as simple a set expression as you can for the shaded areas in the following 
Venn diagrams. 
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(a) ( b ) 




9. Create a Venn diagram for representing sets A, B, and C. The diagram should 
have regions for all possible intersections of A, B, and C. How many different 
regions are there? Label each region with the set expression it represents (e.g., 
the region where all three sets intersect would be labeled A fl B fl C). 

10 . Draw Venn diagrams for sets A, B , and C and shade the areas representing the 
sets in Exercise 1 . 

11 . Verify with Venn diagrams the following laws of Boolean algebra: 

(a) (A U B) n A = A (d) (AUB)flC = (AflC)U(BnC) 

(b) (A fl B) U A = A (e) (A IT B) U C = (A U C) IT {B U C) 

(c) A U A = At 

12 . Use Venn diagrams to determine which pairs of the following set expressions 
are equivalent. 

(a) (AnB)U (AH B) ( c ) (A — B) n (A U B) 

(b) A — (B — (A — B)) (d) (A fl (B — A)) 

13 . Use Law BA2 to prove that AnBflC = AUBUC [note that A D B fl C = 
(A n B) n c = A n (B n C)]. 

14 . Use Law BA3 to prove that AUBUC — AflBHC [note that A U B U C = 
(AU B)UC = AU(BUC)]. 

15 . Use Laws BA1, BA2, and BA3 plus the identities AflA = A,AUA = 
A, API B = B (T A, and AUB = BUAto simplify the following set expressions: 

(a) (AUB)lT(AnB) (c) (A U C)IT (A U B) IT(B U C) 

(b) A - (B U A) (d) (A U B) n [(C U B) - (A n B)] 

16 . Suppose two dice are rolled. How many outcomes have a 1 or a 2 showing on 
at least one of the dice? 

17 . Suppose that a card is drawn from a deck of 52 cards (and replaced before the 

next draw) three times. Let S be the set of outcomes where the first card is a 

spade, H be the set where the second card is a heart, and C be the set where the 

third card is a club. For each of the following sets E, write E as a set expression 
in terms of S, H, C , and determine N(E). 

(a) Let E be the set of outcomes contained in at least one of the sets S , H, C. 
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(b) Let E be the set of outcomes contained in at least one but not all three of the 


sets S, H, C. 

(c) Let E be the set of outcomes contained in exactly two of the sets S, H, C 



A.2 MATHEMATICAL INDUCTION 

The most useful, and simplest, proof technique in combinatorial mathematics and 
computer science is mathematical induction. Let p„ denote a statement involving n 
objects. Then an induction proof that p„ is true for all n > 0 requires two steps: 

1. Initial step: Verify that po is true. 

2. Induction step: Show that if po . p \ , pi. . . . , p n -\ are true, then p n must be true. 

Sometimes p n will be true only for n >k. Then the initial step is to verify that pt 
is true, and in the induction step we assume only that p^, Pk + i, ■ ■ ■ , p n -\ are true. 


Example 1: Summation Formula 

Let s„ denote the sum of the integers 1 through n — that is, s n = 1 + 2 + 3 H — • + n. 
Show that s„ = \n{n + 1). 

We use mathematical induction to verify this formula. Since .v n is not defined, 
the initial step is to verify the formula for \| . The formula says .v i = ^ 1(1 + 1) = 1 — 
obviously correct. For the induction step, we assume that the formula is true for 
s\, S 2 , ■ ■ ■ , s B -i- In this problem (as in most induction problems), we need only to 
assume that the formula is true for ,s„_ i . 

s»_ i = 1 + 2 + 3 + ■ ■ ■ + (n - 1) = I(ti - l)[(n - 1) + 1] = ±(n - 1 )n 

We now use this expression for .v„_. i to prove that the formula is true for s n . 

s„ = [1 + 24 f (n - 1)] + n = s n -\ +n 

= \ (n — 1 )n + n 
= \[{n - l)n + 2n] = \n(n + 1) 

This completes the induction proof that s„ = \n(n + 1) for all positive n. ■ 


Example 2: Population Growth Model 

Suppose that the population of a colony of ants doubles in each successive year. A 
colony is established with an initial population of 10 ants. How many ants will this 
colony have after n years? 

Let a n denote the number of ants in the colony after n years. We are given 
that at) = 10. Since the colony’s population doubles annually, then u\ — 20, a 2 = 40, 
and «3 = 80. Looking at the first few values of a n , we are led to conjecture that 
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a n — 2" x 10. For the initial step, we check that oq = 1 0 = 2° x 10, as required. For 
the induction step, we assume that a„_i = 2"“ 1 x 10. Now we use the annual doubling 
property of the colony: 

On = 2a„_i = 2(2 ,!_1 x 10) = 2” x 10 ■ 


Example 3: Prime Factorization 

A prime number is an integer p > 1 that is divisible by no other positive integer 
besides 1 and itself. Show that any integer n > 1 can be written as a product of prime 
numbers. 

We prove this fact by mathematical induction. Since the assertion concerns in- 
tegers n > 1 , the initial step is to verify that 2 can be written as a product of prime 
numbers. But 2 is itself a prime. Thus 2 is trivially the product of primes (i.e., actually 
of a single prime), itself. Next assume that the numbers 2, 3, . . . , n — 1 can be written 
as a product of primes, and use this assumption to prove that n can also be written as a 
product of primes. If n is itself a prime, then as in the case of 2, there is nothing more 
to do. Suppose n is not a prime, and so there is an integer m that divides n and for 
some integer k,n — km. Since k and m must be less than n. they can each be written 
as a product of primes. Multiplying these two prime products for k and m together 
yields the desired representation of n as a product of primes. ■ 


Although the Greeks used certain iterative arguments in geometric calculations 
and the principle of reductio ad absurdum, mathematical induction was first used 
explicitly by Maurolycus around 1550. Pascal used an induction argument in 1654 to 
verify the additive property of binomial coefficients in the array now known as Pascal’s 
triangle [this property is identity (3) in Section 5.5]. The actual term induction was 
coined by De Morgan 200 years later. See Bussey, “Origin of mathematical induction,” 
American Math. Monthly, 1917, for a fuller discussion of the history of mathematical 
induction. 



EXERCISES 


1. Prove by induction that 1 + 3 + • — h (2 n + 1) = (n + 1) . 


2. Prove by induction that l 2 + 2 2 


- n 2 = 


n(n + 1)(2 n +1) 


3. Prove by induction that — l 2 + 2“ — 3 2 H h (— 1 ) n n z = (— 1)" 


n(n + 1) 


4. Prove by induction that (1 x 2) + (2 x 3) H — • + n{n + 1) = 


n(n + l)(n + 2) 


5. Prove by induction that l 3 + 2 3 + • ■ 


- / 7 3 = 


n 2 (n+ l) 2 
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6 . 

7. 

8 . 

9. 

10 . 

11 . 

12 . 

13. 

14. 

15. 


16. 

17. 

18. 

19. 

20 . 
21 . 
22 . 


Prove by induction that (1 + 2 + - — |- «) 2 = l 3 + 2 3 H — - + m 3 (assume the 
result in Example 1). 

11 In 

Prove by induction that 


Prove by induction that 


1x2 2x3 

1 1 


+ ■ 


n x (n + 1) n + 1 

1 1 


1x4 4x7 7x 10 


( 3 m — 2 ) x ( 3 « + 1 ) 


3 « + 1 


Prove by induction that (1 x 1 !) + (2 x 2!) + • — t- (n x m!) = (n + 1)! — 1. 

1 - a" +1 , 

Prove by induction that for a ^ 1 , = 1 + a + a + ■ — ha. 

1 — a 

- , , r . n(2n — 1)(2m + 1) 

Prove by induction that 1 - + 3 2 + 5“ H h (2m — l) 2 = . 

If the number a„ of calf births at Dr. Smith’s farm after n years obeys the law 
a n = 3a„_i — 2ci n -2 and in the first two years ai = 3 and a 2 = 7, then prove by 
induction that a n — 2" +1 — 1. 


Prove that any positive integer has a unique factorization into primes. You may 
assume the result of Example 3; you need to use the fact that if a prime p divides 
a product of positive integers, then p divides one of the integers. 

Write a computer program to find and print the prime factorizations of the first 
50 integers. 

Prove by induction that for any integer m > 0, m x n = n x m. By r x s, we 
mean the sum of r copies of .v. 

Prove by induction that for integers n >5,2" > n 2 . 

Prove that the number of different subsets (including the null set and full set) of 
a set of n objects is 2" . 

Prove by induction that A\ fl A 2 fl . . . fl A„ = A\ U A 2 U UA„ (see Exercise 13 
in Section A.l). 

Prove by induction that if n distinct dice are rolled, the number of outcomes 
where the sum of the faces is an even integer equals the number of outcomes 
with an odd sum. 


Prove by induction that the number of n-digit binary sequences with an even num- 
ber of 1 s equals the number of M-digit binary sequences with an odd number of 1 s. 
Prove by induction that the sum of the cubes of three successive positive integers 
is divisible by 9. 

The cat population in a dormitory has the property that the number of cats in 
one year is equal to the sum of the number of cats in the two previous years. If 
in the first year there was one cat and if in the second year there were two cats, 
then prove by induction that the number of cats in the nth year is equal to 


1 

7 ! 


~ + Vs 

2 2 


n+1 




n + 1 -1 
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23 . Why cannot one prove by induction that the number of binary sequences of all 
finite lengths is finite? 

24 . What is wrong with the following induction proof that all elements x\ , . . . , x n 
in a set of n elements are equal? 

(a) Initial step ( n — 1): The set has one element x\ equal to itself. 

(b) Induction step: Assume xi = X 2 = ■ ■ ■ — x n -\ . Since also x n -\=x n by 
induction assumption (when x„_i, x„ are considered alone as a two-element 
set), then x\ — X 2 = ■ ■ ■ = x n -\ = x n . 

25 . What is wrong with the following induction proof that for a ^ 0, a' 1 = 1? 

(a) Initial step (n = 0): a 0 — 1 — always true. 

(b) Induction step: Assume a" = 1 and now a" — — — = 1. 

✓ 


A.3 A LITTLE PROBABILITY 

Historically, counting problems have been closely associated with probability. Indeed, 
any problem of the form “how many hobbits are there who ...” has the closely related 
form “what fraction of all hobbits ...” which in turn can be posed probabilistically as 
“what is the probability that a randomly chosen hobbit . . . ?” The famous Pascal’s tri- 
angle for binomial coefficients was developed by Pascal around 1650 in the process of 
analyzing some gambling probabilities. The probability of getting at least seven heads 
on 10 flips of a fair coin, the probability of being dealt a five-card poker hand (from a 
well-shuffled deck) with a pair or better, and the probability of finding a faulty calcu- 
lator in a sample of 20 machines if 5 percent of the machines from which the sample 
was drawn are faulty — all these probabilities are essentially counting problems. 

Two hundred years ago, the French mathematician Laplace first defined proba- 
bility as follows: 

number of favorable cases 

Probability = 

total number of cases 

This definition corresponds to the “person in the street’s” intuitive idea of what prob- 
ability is. In this text, we treat probability problems only where Laplace’s definition 
of probability applies. Implicit in this definition is the assumption that each case is 
equally likely. If the total number of cases is infinite, for example, all real numbers 
between 0 and 1 , then we would not be able to use Laplace’s definition. A more subtle 
difficulty is that in some probability problems, “cases” have to be carefully defined 
or else they may not all be equally likely, as, for example, the possible numbers of 
heads observed when a coin is flipped 10 times. To clarify this difficulty, we need to 
introduce a little of the basic terminology of probability theory. 

An experiment is a clearly defined procedure that produces one of a given 
set of outcomes. These outcomes are called elementary events and the set of all 
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elementary events is called the sample space of the experiment. We are interested 
only in experiments where the elementary events are equally likely. An event that is 
a subset of several elementary events is called a compound event. 

For example, when a single die is rolled, then obtaining a specific number, such 
as 5, is an elementary event, whereas obtaining an even number is a compound event. 
If S is the sample space of the experiment and E is an event in .S', then Laplace’s 
definition of probability says that the probability of event E, prob(£), is 


problL) 


Vi/.) 

N(S) 


In most instances, the size of S is easily determined, and so the problem of deter- 
mining prob(£) reduces to counting the number of outcomes (elementary events) in 
event E. Returning to the die roll, the sample space of outcomes is S = {1,2, 3, 4, 5, 6}. 
If the event E is obtaining an even number, then E = {2, 4, 6}, and prob( E) = | = i. 

Many experiments we discuss involve a repeated (or simultaneous) series of sim- 
ple experiments. Each round of the simple experiment is called a trial. For example, 
the experiment of flipping a coin three times involves three successive trials of the 
simple experiment of flipping a coin. Rolling two dice and recording the sum of 
the two values rolled involves the two simultaneous simple experiments of rolling 
a single die. In any experiment involving multiple trials, the elementary events are 
the sequences of outcomes of the simple experiments. If the simple experiments are 
performed simultaneously, we number the simple experiments and list their outcomes 
in order of the experiments’ numbers. 

The sample space of elementary events for the experiment of flipping a coin three 
times is 


{IIIIH. HHT, HTH, HIT. 77///. TUT. TTH, TTT} (1) 

The sample space for the experiment of rolling two dice is 

5 = {O', j) 1 1 < i < 6, 1 < ./ < 6} 

Let us consider more closely the experiment of rolling two dice and recording 
the sum of the two values on each die (the sample space is the set S listed above). 

In terms of our formal definition of events, the sum of the two values equaling 
k is not an elementary event but rather a compound event. The event, the sum of the 
two dice equals 7, is the subset of elementary events 

5! ={(1,6), (2, 5), (3, 4), (4, 3), (5, 2), (6, 1)} 

Thus, prob(sum = 7) = N(Si)/N(S) = 6/36. Similarly, S 2 — {(L 1)} and so prob 
(sum = 2) = N(S 2 )/N(S) = 1/36. Observe that a sum of 7 is six times as likely 
as a sum of 2. If we had considered the possible sums of the two dice as the elemen- 
tary events, we would have had a sample space S* = {2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12} 
and each sum would have had probability 1 /N(S*) = 1/11 — clearly a mistake. The 
same type of error would have been made if we were to regard the number of heads 
as the elementary events in the experiment of flipping three coins [see the correct 
sample space listed above in (1)]. 
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Suppose we have a box containing five identical red balls and 20 identical black 
balls. Our experiment is to draw a ball. What is the sample space of elementary events? 
All we can record as an outcome is the color of the ball, leading to the sample space 
S = {red, black}. But then by Laplace’s definition of probability, picking a red ball 
will have probability | . Although this experiment consists of just a single trial, we 
are clearly making the same sort of mistake that occurs when the sums of dice are 
treated as elementary events. The reader is encouraged to pause a moment and try to 
supply his or her own explanation for resolving this red-black ball paradox before 
reading the next paragraph. 

The logical solution is to consider not what color ball we withdraw from the box 
but where in the box we direct our hand to grasp a ball. There are 25 different spatial 
positions where balls are located in the box. So, in this sense, there are 25 different 
events. We resolve this complication about “identical” objects with the following rule. 

Identical Objects Rule 

In probability problems, there are no collections of identical objects; all objects are 
distinguishable. 

In complex experiments one should always take a moment to be sure that the 
elementary events are properly identified. 

The original motivation for studying probability was the same as used in this 
book, games of chance — rolling dice, flipping coins, and card games. Cardano cal- 
culated the odds for different sums of two dice in the 1500s. Around 1600, Galileo 
calculated similar odds for three dice. A series of letters about gambling probabilities 
between Pascal and Fermat, written around 1650, constitute the real beginning of 
probability theory. Jacques Bernoulli’s Ars Conjectandi (1713) was the first system- 
atic treatment of probability (and associated combinatorial methods). One hundred 
years later, Laplace published his epic treatise Theorie Analytique des Probabilities, 
a work containing both the definition of probability in a finite sample space used in 
this book and also advanced calculus-based derivations of modern probability theory. 
See F. N. David, Games, Gods, and Gaming: A History of Probability (Dover, 1998) 
for an excellent history of probability theory. 


EXERCISES 



1. An integer between 5 and 1 2 inclusive is chosen at random. What is the probability 
that it is even? 

2 . In the experiment of flipping a coin three times, let E k be the compound event 
that the number of heads equals k. Determine prob( If ), for k = 0, 1, 2, 3. 

3 . In the experiment of rolling two distinct dice, find the probability of the following 
events: 

(a) Both dice show the same value. 
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(b) The sum of the dice is even. 

(c) The sum of the dice is the square of one of the die’s value. 

4 . A die is rolled three times. What is the probability of having a 5 appear at least 
two times? 

5 . Find the probability that in a randomly chosen arrangement of the letters h, a, t, 
the following occurs: 

(a) The letters are in alphabetical order. 

(b) The letter h occurs somewhere after the letter t in the arrangement. 

6. Find the probability that in a randomly chosen (unordered) subset of two numbers 
from the set 1, 2, 3, 4, 5 the following occurs: 

(a) The subset is {1, 2}. 

(b) 1 is not in the subset. 

7. An urn contains six red balls and three black balls. If a ball is chosen, then returned, 
and a second ball chosen, what is the probability that one of the following is true? 

(a) Both balls are black. 

(b) One ball is black and one is red. 

8. An urn contains two black balls and three red balls. If two different balls are 
successively removed, what is the probability that both balls are of the same 
color? 

9. Two boys and two girls are lined up randomly in a row. What is the probability 
that the girls and boys alternate? 

10 . Five distinct dice are rolled. What is the probability of getting at least one 6? 

11 . If a school has 100 students of whom 50 take French, 40 take Spanish, and 20 
take French and Spanish, then what is the probability that a randomly chosen 
student takes no language? 

12 . What is the probability that an integer between 1 and 50 inclusive is divisible by 
3 or 4? 

13 . There are three urns each with two balls: One urn has two black balls, the second 
has two red balls, and the third a black and a red ball. If an urn is randomly 
chosen and a randomly chosen ball in that urn is red, what is the probability that 
the other ball in this urn is also red? (Hint: Let the sample space be the set of all 
experimental outcomes where a red ball is chosen first.) 

14 . What should the sample space be for the following problem: An urn has 10 black 
balls and five red balls; four balls are removed but not seen (our eyes are shut) 
and then a fifth is removed and observed; now what is the probability that the 
fifth ball is red? 

15 . What should the elementary events be in each of the following experiments so 
that they are equally likely? 

(a) A fair coin is tossed until two heads occur or until the coin is flipped 10 times. 
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(b) A positive integer is chosen at random. 

(c) Two positive integers are chosen at random and their difference is recorded. 


(d) A ball is randomly chosen from an urn with two red and two black balls and 
this ball is replaced and a ball chosen again, if necessary, until a red ball is 
obtained. 



A.4 THE PIGEONHOLE PRINCIPLE 

The pigeonhole principle is one of the most simple-minded ideas imaginable, and yet 
its generalizations involve some of the most profound and difficult results in all of 
combinatorial theory. This topic is included in these appendices because it does not 
fit naturally into any chapter of this book. More mathematical texts on combinatorics 
devote a whole chapter to Ramsey theory — for example, see Brualdi’s Introductory 
Combinatorics, 4th ed. (Prentice Hall, 2004). For a thorough treatment of Ramsey 
theory, see Graham, Rothschild, and Spencer, Ramsey Theory (John Wiley, 1990). 


Pigeonhole Principle 

If there are more pigeons than pigeonholes, then some pigeonhole must contain two 
or more pigeons. More generally, if there are more than k times as many pigeons as 
pigeonholes, then some pigeonhole must contain at least k + 1 pigeons. 

This principle is also called the Dirichlet drawer principle. An application of it 
is the observation that two people in New York City must have the same number of 
hairs on their heads. New York City has over 7,300,000 people, and the average scalp 
contains 100,000 hairs. Indeed, the pigeonhole principle allows us to assert that it is 
theoretically possible to fill a subway car with about 73 New Yorkers, all of whom 
have the same number of hairs on their head. 

The following three problems suggest some of the diverse generalizations of the 
pigeonhole principle. 

1. How large a set of distinct numbers between 1 and 200 is needed to assure that 
two numbers in the set have a common divisor? 

2. How large a set of distinct numbers between 1 and n is needed to assure that the 
set contains a subset of five equally spaced numbers a\, 02 , < 73 . < 74 , < 7 . 5 — that is, 
#2 — d\ — a^ — a 2 = <74 — a^ — a$ — a\l 

3. Given a positive integer k, how large a group of people is needed to ensure that 
either there exists a subset of k people in the group who all know each other or 
there exists a subset of k people none of whom know each other? 

In Problem 3, it might be possible that for large values of k, there are groups 
of arbitrarily large size that do not meet the k mutual friends or k mutual strangers 
property. Such is not the case. One of the theorems of Ramsey theory is that there 
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exists a finite number r k such that any group with at least r k people must satisfy 
this property. The theorem does not say what number r k is. Actually, r k has been 
determined for only a few small values of k. 

Most generalizations of the pigeonhole principle require special, research-level 
skills rather than the combinatorial problem-solving logic and techniques that this 
text seeks to develop. However, we include the basic pigeonhole principle in this 
text because it can occasionally be very helpful. The following examples illustrate an 
obvious and two not-so-obvious uses of the principle. 

Example 1: Pooling Responses 

There are 20 small towns in a region of west Texas. We want to get three people from 
one of these towns to help us with a survey of their town. If we go to any particular 
town and advertise for helpers, we know from past experience that the chances of 
getting three respondents are poor. Instead, we advertise in a regional newspaper that 
reaches all 20 towns. How many responses to our ad do we need to assure that the set 
of respondents will contain three people from the same town? 

By the pigeonhole principle, we need more than 2 x 20 = 40 responses. ■ 

Example 2: Connecting Computers with Printers 

We have 15 minicomputers and 10 printers. Every five minutes, some subset of 
the computers requests printers. How many different connections between various 
computers and printers are necessary to guarantee that if 10 (or fewer) computers 
want a printer, there will always be connections to permit each of these computers to 
use a different printer? 

Using a variant of the pigeonhole principle, we see that there must be at least 60 
connections. Otherwise one of the 10 printers, call it printer A, would be connected 
to five (or fewer) computers and if none of the five computers connected to A were in 
the subset of 10 computers seeking printers, then printer A could not be used by any 
of these 10 computers. It happens that exactly 60 connections, if properly made, will 
solve the connection problem (see Exercise 15). ■ 

Example 3: Subsets Summing to 4 

Show that any collection of 8 positive integers whose sum is 20 has a subset summing 
to 4. 

We show that the collection must have one of the following four subsets with a 
sum of 4: (a) four Is; (b) two 2s; (c) two Is and a 2; (d) a 1 and a 3. That one of these 
possibilities must hold is a pigeonhole-type argument. 

If S contained no 1 or 2, then its sum would be at least 3x8 = 24. So by the 
pigeonhole principle, S must contain a 1 or a 2. 

Suppose S contains a 2. We are finished if there is a second 2 or if there are 
also two Is. Possibly there is one 1, but all other integers are at least 3. If there is 
no 1, the other 7 integers, each at least 3, must sum to 20 — 2 = 18 — impossible by 
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a pigeonhole principle-type argument. If there is one 1 . the other 6 integers, each at 
least 3, must sum to 20 — 2 — 1 = 17 — again impossible. 

Suppose S contains at least one 1 but no 2. If there is a 3 in S, case (d) applies. If 
there are four Is in S , case (a) applies. The alternative is no 2s and no 3s in S and at 
most three Is. Then there are at least five other integers in S of size at least 4 whose 
sum must be less than 20 — impossible. ■ 


EXERCISES 



1. Given a group of n women and their husbands, how many people must be chosen 
from this group of 2 n people to guarantee the set contains a married couple? 

2 . Show that at a party of 20 people, there are two people who have the same number 
of friends. 

3 . In a round-robin tournament, show that there must be two players with the same 
number of wins if no player loses all matches. 

4 . Given 10 French books, 20 Spanish books, 8 German books, 15 Russian books, 
and 25 Italian books, how many books must be chosen to guarantee that there 
are 12 books of the same language? 

5. If there are 48 different pairs of people who know each other at a party of 20 
people, then show that some person has four or fewer acquaintances. 

6 . A professor tells three jokes in her ethics course each year. How large a set of 
jokes does the professor need in order never to repeat the exact same triple of 
jokes over a period of 12 years? 

7 . Show that given any set of seven distinct integers, there must exist two integers 
in this set whose sum or difference is a multiple of 10. 

8. Show that if n + 1 distinct numbers are chosen from 1,2, ... ,2 n, then two of 
the numbers must always be consecutive integers. 

9 . Suppose the numbers 1 through 10 are randomly positioned around a circle. Show 
that the sum of some set of three consecutive numbers must be at least 17. 

10 . A computer is used for 99 hours over a period of 12 days, an integral number of 
hours each day. Show that on some pair of two consecutive days, the computer 
was used for at least 17 hours. 

11 . Show that any subset of eight distinct integers between 1 and 14 contains a pair 
of integers k, l such that k divides /. 

12 . Show that in any set of n integers, n > 3, there always exists a pair of integers 
whose difference is divisible by n — 1 . 

13 . Show that any subset of n + 1 distinct integers between 2 and 2 n in > 2) always 
contains a pair of integers with no common divisor. 

14 . Show that any set of 16 positive integers (not all distinct) summing to 30 has a 
subset summing to n, for n = 1,2,..., 29. 
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15. In Example 2, find the required set of 60 computer-printer connections. 

16. There used to be six computers and 10 printers in a large computing center. Each 
computer was connected to some subset of printers. Now the 10 old printers are 
being replaced by six more reliable printers, but temporarily the computers will 
still be allowed to believe that there are 10 printers. Four dummy printers will 
pass on computer requests to the six other real printers. How many connections 
between the four dummy printers and the six real printers are needed to handle 
any set of six printing requests? 

17. Show that for any set S of 10 distinct numbers between 1 and 60, there always 
exist two disjoint subsets of S (not necessarily using all the numbers in S) both 
of whose numbers have the same sum. 

18. Two circular disks each have 10 0s and 10 1s spaced equally around their edges 
in different orders. Show that the disks can always be superimposed on top of 
each other so that at least 10 positions have the same digit. 

19. A student will study basketweaving for at least an hour a day for seven weeks, 
but never more than 1 1 hours in any one week. Show that there is some period 
of successive days during which the student studies a total of exactly 20 hours. 

20. If G is an n- vertex graph in which each vertex has degree > in — I )/2, show that 
G is connected, i.e., there exists a path joining every pair of vertices in G. 

21. Show that any sequence of n 1 + 1 distinct numbers contains an increasing sub- 
sequence of n + 1 numbers or a decreasing subsequence of n + 1 numbers. 


22. Show that at any party with at least six people, there exists either a set of three 
mutual friends or a set of three mutual strangers. 



A.5 COMPUTATIONAL COMPLEXITY 
AND NP-COMPLETENESS 

This section gives a summary of the basic ideas of computational complexity. Not 
surprisingly, a major concern of computer science is the speed of a computation, 
whether it be a numerical algorithm to plot the trajectory of a spacecraft or an opera- 
tions research procedure to schedule airline crews or an information retrieval search 
in a database. For some computing problems, such as sorting (in alphabetical order) a 
list of n words, fast procedures are known. For other problems, such as the Traveling 
Salesperson Problem (determining the shortest Hamilton circuit in an n-vertex graph 
where each edge has a length), the calculation can take an incredibly long time for 
moderate sizes of n, e. g., n = 100. 

The standard way to describe the computational complexity, or speed, of a 
procedure is with a function f(n) whose value is the maximum number of computer 
steps (e.g., additions, multiplications, comparisons) required to execute the procedure 
when n is the “size” of the input data. The size could be the number of vertices in a 
graph or the number of items to be sorted. About the best possible function would be 
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a linear function. As the size of the input grows, the computational time would grow 
at a proportional rate. 

An even faster function is the logarithmic function. With the proper search tree, 
one can identify an unknown word in a dictionary of n words in log 2 « steps (compar- 
isons), as discussed in Section 3.1. The key is that one repeatedly compares whether 
the unknown word is before or after a sequence of test words, so that each comparison 
cuts the possible answers in half. The possible answers are leaves in a binary tree, 
and the internal vertices are the test words. The complexity of the search is the height 
of the tree. A balanced binary tree with n leaves has height log 2 «. 

A slightly slower function than linear would be of the form An\og 2 n, for some 
constant A. This was the number of comparisons of the Heap Sort algorithm for 
sorting n items, presented in Section 3.4. There are n ! possible orderings of a list of n 
items. To choose among n ! possible answers in only AnXog^n comparisons requires an 
algorithm, like the log 2 n recognition algorithm for an unknown word, that repeatedly 
splits the set of possible orderings in half. Note that log 2 (n!) is bounded by n log 2 n. 

Less efficient algorithms have a computational complexity that is a polynomial 
function of n. Recall that a quadratic function grows by a factor of 4 when n doubles, 
a cubic grows by a factor of 8, and an £th order function grows by a factor of 2 k when 
n doubles. With the speed of modern computers, especially parallel supercomputers, 
algorithms with complexity n 4 or n 5 can solve moderately large problems — e.g., 
n = 500. Recall that computational complexity functions look at the worst-case 
behavior. Algorithms that have a worst-case complexity of n 4 might be much faster 
for typical problems. 

A much worse type of computational complexity is an exponential function such 
as 2" or «!. For a complexity of n\, simply increasing n by 1, say, from 49 to 50, 
increases the computation effort by a factor of 50. All known algorithms for the 
Traveling Salesperson Problem have exponential complexity. Computer science the- 
oreticians try to determine lower bounds on the computational complexity of solving 
a certain problem. Of particular interest is the question of whether polynomial algo- 
rithms can exist for very hard problems such as the Traveling Salesperson Problem. 
Cook developed a useful framework for discussing hard problems. He took as his 
starting hard problem the satisfiability problem, namely, determining whether a log- 
ical proposition in n variables is valid. No faster algorithm is known than evaluating 
a logical expression for all possible 2" values of True or False for each variable (see 
Section A. 1 for background on logical propositions). Then Cook looked at modeling 
the satisfiability problem in terms of other hard problems, and conversely modeling 
other hard problems as a satisfiability problem. 

A problem is called NP-complete if the problem can model the satisfiability 
problem and can be modeled by it. The modeling effort is required to take polynomial 
time. These models are usually very complicated. A comparatively simple example 
of this modeling process is presented in Example 1, below which shows how to 
model the problem of deciding whether a graph is vertex 3-colorable as a satisfiability 
problem. For a more complete treatment of NP-completeness models, see any analysis 
of algorithms text. We note that the formal definition of NP-completeness is more 
complicated, but our definition here is equivalent. 
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To be NP-complete, a problem has to be posed as a question with a yes or no 
answer. For example, the question of whether an arbitrary n -vertex graph can be 
colored with three colors is NP-complete, as is the question of whether a graph can 
be colored with any specific number of colors greater than two. To find the chromatic 
number of an /(-vertex graph (the minimum number of colors), one does a binary 
search by asking whether the graph is n/2-colorable; if so, whether it is /i/4 colorable; 
if not, whether it is 3/z/8 colorable, etc. — homing in on the smallest number of colors 
that can color the graph. This strategy requires log 2 n coloring questions. 

Observe that any NP-complete problem can model any other NP-complete prob- 
lem, via the intermediate model as a satisfiability problem. The consequence of this 
equivalence is that if an algorithm of polynomial complexity is found for solving 
any one of these NP-complete problems, then it can be used to provide a polynomial 
algorithm for all NP-complete problems. 

It is generally believed that no polynomial algorithm will ever be found for an 
NP-complete problem. Thousands of very hard problems have been shown to be NP- 
complete. Among the graph problems known to be NP-complete are the existence of a 
Hamilton circuit or Hamilton path, graph colorability, and the size of the largest com- 
plete subgraph. It is still not known if determining whether two graphs are isomorphic 
is an NP-complete problem. 


Example 1: Modeling Graph 3-Coloring as a Satisfiability Problem 

Reformulate the problem of deciding whether an //-vertex graph G can be 3-colored 
as a satisfiability problem. We will create a collection of simple disjunctions (proposi- 
tions using just logical “or”). The satisfiability problem will be determining whether 
there is some choice of the values (True or False) of the logical variables that makes 
all the disjunctions true. The number of values needs to be a polynomial function of 
n. In this case, we will use 3 n variables. 

The choices we have in graph coloring are which color the z'th vertex should be 
as follows: is it color 1 or color 2 or color 3? We define three logical variables for 
each of these possibilities: X/j, x,-. 2 , x ,,3 with x, ; = True if the z'th vertex has color j 
and = False otherwise. 

First we develop disjunctions to represent the constraint that each vertex has 
exactly one color, that is, exactly one of x,-_ i , x,. 2 , x ,,3 is true. The following compound 
propositions express this constraint. 

(Xj,l A X/ 2 A Xj j3 ) V (X/ i A X/ 2 A X/ 3 ) V (x U A X ;>2 A X;, 3 ), 

i =1,2 , ,n 

By repeatedly applying the distributive law for disjunction and eliminating redundant 
terms, we obtain the following a set of disjunctions that are equivalent to the previous 
compound propositions. 


Xi, 1 VXi, 2, X;,1 V Xi, 3, X;, 2 V X;, 3 , 

i = 1, 2 , . . . , n 


Xi,l Vx,-, 2 VXj, 3 , 


( 1 ) 


A.5 Computational Complexity and NP -completeness 433 


Next we need to express the coloring constraint that for each edge (v, , \>j), of G, v, 
and Vj must not have the same color — that is, at least one of v,- and Vj does not have 
color k. 

Xi,kVXj'k, k= 1,2,3, (Vj, Vj) is an edge in G, (2) 

Satisfying the collection of simple disjunctive propositions in ( 1) and (2) is the desired 
propositional formulation of the question of whether G can be 3-colored. ■ 
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/--arrangement is an arrangement with r objects. An arrangement may or may 
not allow repetition of objects. There are Pin , r) / -arrangements without rep- 
etition of r objects chosen from n objects. There are n r /--arrangements with 
repetition of r objects chosen from n types of objects. 

Binomial coefficient: A binomial coefficient is a coefficient in the polynomial 
expansion of a binomial expression such as ( a + x)". The coefficient of x r in 
(1 + x)" is written C(n, r) or ("). This coefficient equals the number of distinct 
/--subsets of an n- set. 

Bipartite graph: A bipartite graph G — ( X , Y, E ) is a graph whose vertices are 
partitioned into two vertex sets, X and T, and every edge in G joins a vertex in X 
with a vertex in Y. 

Chromatic number: The chromatic number of a graph is the smallest number of 
colors that can be used in a coloring of a graph. See Coloring a graph. 

Chromatic polynomial: A polynomial Pk(G) that tells how many ways there are 
to A' -color the vertices of a graph G. 

Circuit: A circuit is a sequence of vertices (xi, X 2 , x^, ... , x n ) where xi = x „ , and 
x, is adjacent to x 1+ i . A vertex may not appear more than once in a circuit (except 
for the same vertex as the starting and ending vertex). 

Coloring a graph: A coloring of a graph G assigns a color to each vertex so that 
adjacent vertices have different colors. One can also color edges so that edges 
with a common end/vertex have different colors. 

Combination: A combination is a subset of objects or, equivalently, an unordered 
collection of objects. Objects may or may not be repeated in a combination. There 
are C(n, r) different combinations without repetition of r objects chosen from 
n objects. There are C(n + r — 1, r) different combinations with repetition of r 
objects chosen from n types of objects. 

Complete graph K n and complete bipartite graph K m y. K„ is a graph on n 
vertices with an edge joining every pair of vertices. K$ is a triangle. Ki is an 
edge. K m n is a bipartite graph with /// and n vertices in its two vertex sets and all 
possible edges between vertices in the two sets. 
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Complementary graph or complement: The complementary graph G = ( V , E ) 
of a graph G = ( V , E ) has the same vertex set V as G does. A pair of vertices are 
joined by an edge in G if and only if they are not joined by an edge in G. 

Component: An unconnected graph G consists of a collection of components or 
“connected pieces.” A connected graph consists of a single component. Formally, 
a component of G consists of some particular vertex x and all vertices reachable 
from x by a path in G. 

Connected graph: A graph is connected if there is a path joining any given pair 
of vertices. A directed graph is connected if it is connected when treated as an 
undirected graph (with all edge directions ignored). 

Cycle: A cycle is a sequence of consecutively linked edges whose starting vertex 
is the ending vertex and in which no edge can appear more than once. Unlike a 
circuit, a vertex can be visited any number of times in a cycle. 

Derangement: A derangement of a given arrangement of distinct objects is a re- 
arrangement such that no object is in the same position it had in the original 
arrangement. 

Directed graph, directed edge: A graph is a directed graph if each edge (a, ~b) is 
directed, going from a to b. A directed graph may contain two oppositely directed 
edges joining two vertices, such as (a^b) and (bba). 

Distribution: A distribution is an assignment of a given set of objects, which may be 
identical or distinct, to a set of distinct destinations. Unless explicitly prohibited, 
more than one object may go to the same destination. 

Edge cover: An edge cover is a set S of vertices such that every edge in any graph 
is incident to one vertex in S. 

Euler cycle, Euler trail: An Euler cycle (trail) is a cycle (trail) that contains all the 
edges in a graph. Furthermore, it must visit each vertex at least once. 

Generating function: A generation function g(x) for a n , the number of ways to 
do some procedure with n objects, is a polynomial or power series with the 
expansion g(x) = ao + a\x + a 2 X 2 H — ■ + a n x n + • ■ ■ . Such a function is also 
called an ordinary generating function in contrast to an exponential generating 

2 n 

function g(x), which has the form g(x) = ao + fli + «2 fr + ■ — h a„ ^ + • ■ ■ . 

Graph: A graph G = (V, E) consists of a finite set V of vertices and a finite set 
E of edges. Each edge e — (a, b) joins two different vertices a, b (a f h). Also, 
two edges cannot join the same pair of vertices. Unless G is a directed graph 
(see Directed graph), ( a , b) and (b, a) are the same edge (order does not matter). 

Hamilton circuit, Hamilton path: A Hamilton circuit (path) is a circuit (path) that 
contains every vertex of a graph. 

Independent set: A set of vertices in a graph is independent if no pair of them are 
adjacent. 

Isomorphism, isomorphic graphs: Two graphs G\ and G 2 are isomorphic if there 
exists a matching, called an isomorphism, of vertices in G 1 with the vertices in 
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Gi so that two vertices in G i are adjacent if and only if the corresponding two 
vertices in G 2 are adjacent. Informally, two graphs are isomorphic if they are 
“the same graph” except that their vertices have different names. 

Map coloring: A map of countries is properly colored by assigning a color to each 
country so that countries with a common border get different colors. 

Matching: A matching in a bipartite graph G = ( X , 7, E) is a subset of edges that 
pair off, in a one-to-one fashion, some vertices in X with some vertices in Y. 

Multigraph: A multigraph is a generalized graph in which (1) multiple edges are 
allowed — two or more edges can join the same two vertices; and (2) loops are 
allowed — edges of the form ( a , a). 

Network: A network is a graph, usually a directed graph, with a positive integer 
k{e) assigned to each edge e of the graph. 

Network flow: A flow is a function on the edges of a network that satisfies certain 
constraints listed at the start of Section 4.3. 

Partition: A partition of a collection of identical objects divides the objects into 
a collection of groups of various sizes. One can also speak of a partition of an 
integer n as a collection of positive integers that sum to n. 

Path: A path is a sequence of vertices (x\, X 2 , x$, . . . , x n ) such that x, is adjacent to 
Xj + 1 . A vertex may not appear more than once in a path, except possibly x\ = x„. 

Permutation: A permutation of a set or sequence of objects is an arrangement of 
the set or sequence, normally with no repetition allowed. An /--permutation is an 
/--arrangement of r objects chosen from the set or sequence. 

Planar graph: A graph is planar if there exists a way to draw it on a sheet of paper 
so that no edges cross. A plane graph is a planar graph drawn so that no edges 
cross. 

Recurrence relation: A recurrence relation is an equation such as a„ = 2a n _ 1 + 
3a n -2, in which a„, the number of ways to do some procedure with k objects, is 
expressed in terms of other af s, where k < n. 

Selection: A selection is an unordered collection of objects. Objects may or may 
not be repeated in a selection. There are C(n,k ) different selections without 
repetition of k objects chosen from n objects. There are C(zz + k — 1 , k) different 
selections with repetition of k objects chosen from n types of objects. 

Subgraph: A subgraph is a graph that is contained in another graph. If G' — ( V E') 
is a subgraph of G — (V, E), then f'cy and E' C E. 

Trail: A trail is a sequence of consecutively linked edges in which no edge can 
appear more than once. Unlike a path, a vertex can be visited any number of 
times in a trail. 

Tree: A tree is a graph with a special vertex called a root and for each vertex x, 
other than the root, there is a unique path from the root to x. An undirected tree 
is characterized as a connected graph with no circuits. For tree-related terms, see 
the following Subglossary of Tree Terminology. 
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Subglossary of Tree Terminology 

Ancestors: Ancestors of vertex x are the set of vertices on the path from the root 
to x. 

Backtracking search: See Depth-first search. 

Balanced tree: A tree with all leaves at level h and h — 1, where h is the height of 
the tree. 

Binary tree: A tree in which each internal vertex has exactly two children. 

Breath-first search: From the root, find all vertices z adjacent to the root, then all 
vertices adjacent to one of the zs, and so on. 

Child: Children of vertex x are vertices y with an edge (x7y) from x to y. 

Depth-first search: Also known as backtracking search, builds a path from the root 
as far as possible; one backtracks from the current vertex when the path cannot 
be extended to a previously unvisited vertex and backs up the current path until 
a vertex is found at which a side path may be constructed to a new vertex. 

Descendant: Descendants of vertex x are the set of vertices z whose path from the 
root to z contains x. 

Height: The largest level number in a tree. 

Inorder traversal: In a binary tree, starting from the root, this traversal recursively 
lists the vertices of the left subtree of a particular vertex x, then x, and then the 
vertices in the right subtree of x. 

Internal vertex: A vertex with children; internal vertex and parent are equivalent 
terms. 

Leaf: A vertex with no children; a leaf’s only incident edge comes in from its parent. 

Level or level number: The length of the path from the root to a given vertex — e.g., 
level 2 of a tree consists of all vertices whose path from the root has length 2. 

m - ary tree: A tree in which all internal vertices have m children. 

Parent: The parent of vertex x is the unique vertex z with an edge (zf x) from z to x. 

Postorder traversal: Lists vertices in the order they are last encountered in a depth- 
first search of a tree. 

Preorder traversal: Lists vertices in the order they are first encountered in a depth- 
first search of a tree. 

Root: The special vertex in a tree with a unique path to any other specified vertex; 
in a directed or rooted tree, the root is the unique vertex at level 0 that has no 
parent. 

Rooted tree: A directed tree with all edges directed away from the root. 

Sibling: Siblings of vertex x are those vertices with the same parent as x. 

Spanning tree: A tree that is a subgraph of a connected graph and that contains all 
vertices of the graph. 

Subtree: A connected subgraph of a tree. 
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SOLUTIONS TO 

ODD-NUMBERED 

PROBLEMS 



1. R Bu G Y. 

3. YBuBu W. 

5. Bu Bk Bk Bu or Bk Bu Bu Bk. 

7. G Y R Bk. 

9. WWR Bu. W Bu R Bu, W Bu R W, Bk Bk R Bu, Bu Bk R Bu, Bu Bu Bk R. 
11. ORYBuP. 

13. (a) Three black and one white, (b) 14. 

15. 9. 

17. 1040. 

19. Two of one color and one of a second color. 


CHAPTER ONE SOLUTIONS 

Section 1.1 

1. (a) 


F 



(b) 2 (C, D), (C, E), 

(c) Yes, several routes. 

3. (a) A 5-circuit. 

5. (a) Many possibilities, 

(b) min = 4, several possibilities. 

7. (a) { A-a , B-b, C-d, D-c}. 

(b) A and C each can fill only job b. 


441 


442 Solutions to Odd-Numbered Problems 


9. (a) No possible (odd number of vertices), 

(b) a,c, e collectively must be matched with just b and e. 

13. (a) Vertex = variety of chipmunk, if A splits into B and C, then make edges 
(AT B) and (AfC), 

(b) 7 splits. 

15. (a) 4 other pairs. (b) {b, j), {c,h} and 6 other pairs. 

17. C, E,c,d (answer for both parts of the question). 

19. Minimal block surveillance — 6, minimal corner surveillance — 3. 

21. (a) 5: squares (2, 4), (3, 4), (4, 4), (5, 4), (8, 4), (b) 8. 

23. (a) (i) {b, c], (ii) {A, a, B, b, D, ej, 

(b) (i) {a, 4}, (ii) {C,c,d, Ej. 

25. (a) 6, K i 6 (b) 7, isolated vertices. 

29. (a) (b) not 



B A 


/\\ 

ABA 

/\\ 

ABA 

A\ 

ABA 


Section 1.2 

1 . 


U Plus complements 
of first 4 graphs 

5. (a) No, right graph only has circuits of length 5, 

(b) Yes, a- 6, b- 1, c-3, d- 5, e-2, f- 4, 

(c) Yes, a- 3, b- 4, c-7, 4-1, e-6, /- 8, g-2, /?-5, i-9, 

(d) No, degree-2 vertex is part of triangle only in right graph, 

(e) No, right graph has one more edge, 

(f) No, subgraphs of vertices of degree 3 do not match, 

(g) Yes, a-1, b- 3, c-5, 4-4, e-1, f- 2, g- 6, 

(h) No, degree-3 vertices are adjacent only in left graph, 

(i) Yes, a-1, b- 3, c-5, 4-2, e-4, /-6, 

(j) No, degree-2 vertices are adjacent only in left graph, 

(k) Yes, a-1, b- 8, c-4, 4-7, e-6, f- 9, g- 11, h-10, i-2, j- 3, rt-5, 

(1) Yes, a-1, b- 2, c-3, etc. 

7. Graphs 1-6, 13-18, 31-36, 37-42 mutually isomorphic, 7-12, 19-24, 25-30 
isomorphic. 

9. No, building on the isomorphism in Example 2, b — > c but 5 -> 2. 

13. 3 in each. 
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Section 1.3 

1. (a) 12, (b) 9, (c) 8 or 10 or 20 or 40. 

3. 12. 

5. Solve for n in terms of m in the formula m = n(n — l)/2. 

7. If v vertices, then e = \ vp edges (where \v is an integer since p is odd). 

9. Sum of in-degrees (or out-degrees) = number of edges, since sum counts each 
edge once. 

13. (a) No, (b) Yes, (c) No. 

Section 1.4 

1. (a) (b) 



3. (a) No, delete ( b , c), ( e , /): {a, e, /}, {b, c, dj, 

(b) No, delete (b, j), (e, g ): {a, d, h], {c, /, i), 

(c) No, many possible ^3 3 , 

(d) No, delete (a, b), ( b , c), (d, e), (/, g ): {a, d , e}, {c, /, g}, 

(e) Yes, (f) No, delete a and ( b , c), (g) Yes 

(h) No, delete (e) : {a, b, c, d, /} 

(i) No, delete ( d , e): {a, c, e}, {b, /, g), 

(j) Delete (/, g): {a, c, e}, { b , c/, /;}, 

5. (a) rc < 4, (b) r or s < 2. 

7. (a) Possible, e = 11, 

(b) Possible, r = 7, 

(c) Not possible, 

(d) Possible, v = 7, 

(e) Possible, e = 12, r = 8 , 

(f) Possible, v = 7, 

(g) Possible, e = 12, v = 8 , 

(h) Not possible (parity violation), 

(i) Possible, e = 20, r = 10, 

(j) Not possible. 

9. (a) Degree of vertices in K 5 is 4 => degree of vertices in L(K$) is 2 x (4 — 1) = 6 ; 
L(K S ) has v = 10 and e = | J]deg = 3v = 30 => e 3v — 6 ; 
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(b) 


11. (a) Circuit length = sum of number of boundary edges of R\ and AS — 2; 

(b) Circuit length = sum of number of boundary edges of each enclosed region 
minus 2 x (number of edges interior to circuit). 

13. (a) Al 3 3 and K$ are critical nonplanar. 

15. (a) r = e — v + c+1, 

(b) Using (a), corollary becomes e < 3v — 3c — 3 (< 3v — 6). 

19. If false, deg >5 and so 5v < sum of degrees = 2e < 2(3v — 6) = 6v — 12, that 
is, 5v < 6v — 12 or 12 < v — impossible. 

25. v = p + 21, e = — 2p + 31, answer = r— l = e — v+l=p + /+l. 



Supplement 

1. Vertex = committee, edge = committee overlap (person). Graph is Ki with 
e = iX>g= j(6v) = 21. 

3. n = 12. 

7. Yes, each component of G is a circuit. 

9. (a) Yes, trace out any sequence of edges and eventually a vertex z will be repeated, 
between first and second visit to z a circuit is formed, 

(b) No — e.g., 



11. Vertices in different components of G are directly adjacent in G; vertices in 
same component are joined in G by a path of length 2 via any vertex in other 
component of G . 

15. If: suppose not strongly connected with no path from a to b — let V) consist 
of a and all vertices that can be reached by a directed path from a, V 2 is other 
vertices, only if: obvious. 

17. A bridge edge cannot lie on a circuit. 

19. (a) Yes, see Exer. 7 in Section 1.2, 

(b) No, odd number of vertices of odd degree, 

(c) No; e = ^^deg = 13 =>■ e ^3v — 6. 

21. Possible, a K\ 5 with two extra edges. 

23. Consider a vertex y that beat x, [i.e., (yj* x)]; if y beat every competitor that beat 
x, then y would have a greater score that x — not possible — and so for some w, 
(x 7 w) and (wry). 

25. (a) Repeatedly remove side circuits until trail from x to y has no repeated vertices. 
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(c) Similar argument to part (a). 

(d) If odd number of edges of cycle are partitioned into circuits [part (d)], then 
some circuit must have odd number of edges. 

27. (a) (h, a), [h, g). 

29. Two-component, « -vertex graph with the most edges is a K n ~\ plus an isolated 
vertex; it has \{n— 1 ){n — 2) edges. 

33. (a) 5-circuit or 3-edge path. 

(b) By hint, if G has n vertices, number of edges in G = | (number of edges in 
K„) = f [\n{n — 1)] = \n{n — 1). Since n and n — 1 are not each divisible 
by 2, then either n or n — 1 must be divisible by 4, i.e., n — 4k or n = 4k + 1 . 
35. If: obvious, only if: let x„ be vertex with 0 out-degree (if no such vertex, there 
is a directed circuit — Exer. 9(a), move x„ from graph and let x„_i be vertex with 
0 out-degree in remaining graph, continue indexing in this fashion. 

37. (a) 


ab 


be 

(b) Each edge of K n is incident to n — 2 other edges at each end vertex, 2(/i — 2) 
incidences in all, 

(c) Can be no vertices of degree 0 or 1 and so all degrees >2; since e = i^deg, 
then to have e = v (so that G and L(G) have same number of vertices), all 
deg = 2 =>■ G is any circuit. 

39. (a) b -o- c, a and d fixed, 

(b) None, 

(c) a^c, e -o- /, b and d fixed. 

41. Sketch of proof: show that each vertex in (C i U Cf) — (C i (T Ct) has even degree 
and then repeatedly trace a path (without repeating any edge) until a vertex x is 
revisited and remove the circuit formed between the first and second visit to x. 



CHAPTER TWO SOLUTIONS 
Section 2.1 

1. (a) Many possibilities, 

(b) Many possibilities with b and / as end vertices. 

3. One example is a graph consisting of a circuit of 7 edges. 

5. (a) No, once bridge crossed there is no way back to starting vertex, 

(b) Many possibilities, e.g., a 10-edge path. 

7. An isolated vertex added to a connected graph with even degrees now has a Euler 
cycle but is not connected. 
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9. Build a graph with a vertex for each racer and an edge for each race; a Euler trail 
corresponds to a sequence of races in which each racer is in two consecutive races; 
this graph has the desired Euler trail because only A and F have odd degree. 

11. A set of deadheading edges must have one edge at each odd-degree vertex; joining 
these edges at odd-degree vertices together (without changing the parity of the 
degree of other vertices) requires a set of paths. 

13. A directed graph has a Euler trail if and only if at all but two vertices indegree = 
outdegree and at those two, indegree and outdegree differ by 1 . Proof: add an 
extra edge so that indegree = outdegree at two unbalanced vertices and resulting 
graph has Euler cycle; remove added edge yielding desired Euler trail. 

15. No such Euler cycle containing all vertices. 

17. (a) Many possibilities, 

(b) If at every stage graph of remaining edges is connected, then just before 
using last edge E and being forced to stop at starting point, E is only edge 
remaining and once taken there are no remaining edges, 

(c) Applies to Euler trails. 

19. (a) oi 



(b) Concatenating the first digits of each node on an Euler cycle produces desired 
sequence, 

(c) Many possibilities, e.g., cycle 00-00-01-11-11-10-01 (-00) produces 
00011101 . 

Section 2.2 

1. Many possibilities — e.g., 

(a) , (b) 



3. a-g-c-b-f-e-i-k-h-d-j-a. 

5. Path: m-b-c-d-e-q-a-l-k-j-o-p-n-f-g-h-i, no circuit: by symmetry at p, delete any 
edge, choose p-m, at m and c rule 1 forces subcircuit m-b-c-d-m. 

7. (a) Rule 1 at /, h, j forces subcircuit e-f-g-h-i-j-e, 

(b) By symmetry at p, use m-p-n and delete p-o forcing k-o-i, at m and n. if both 
use edge going up (to g) then subcircuit p-m-g-n-p, if both m, n use edge 
down then subcircuit p-m-k-o-i-n-p, so by symmetry use m-g and n-i, deleted 
i-h, i-c, i-j forcing g-h-b-c-d-j-k, plus k-o-i-n-p-m-g, yielding a subcircuit. 
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9. (a) Hamilton circuit alternates between red and blue vertices and so must have 
equal numbers of each, 

(b) Follows from part (a), 

(c) (i), (ii) Have odd number of vertices, (iii) Has 9 vertices in one part and 7 
vertices in other part of bipartition. 


11. (a) 



Edges between vertices at 
corresponding positions of 
the two cubes 


(b) Many possibilities : 1 -0000, 2-000 1 , 3 -00 1 1 , 4-00 1 0, 5 -0 1 1 0, 6-0 1 1 1 , 7-0 1 0 1 , 
8-0100, 9-1100, 10-1101, 11-1111, 12-1110, 13-1010, 14-1011, 15-1001, 
16-1000 



15. Many solutions but very tedious; the following heuristic works: at each stage, look 
at all possible squares (not yet visited) a knight’s move from current square and 
move to a square with the minimum number of possible squares for the next move. 

17. (a) Rook: starting at upper left comer, go down first column, square by square, 
at bottom of first column move right to bottom of second column, move up 
second column, square by square, continue going down and up successive 
columns finishing at top square of right column from which one moves back 
to top square of left column, king: similar to rook, except in columns 2, 3, 
. . . , n — 1, avoid top square, when top square of right column reached, move 
left along top squares of each column to return to top left square, 

(b) Rook: not possible (variation on reasoning in Exer. 14), king: (1, 1)-(1, 2)- 
(2, 2)-(l, 3) . . . (2, n ), now go left and right covering rows 3 through n 
avoiding left square in each row except row n, finishing at (1, n) and now 
move up first column to starting square. 


19. (a) nl. 


(b) Form the Hamilton circuits by placing vertices in a circle: first circuit formed 
by joining consecutive vertices, second circuit formed by joining vertices 
2 positions apart (one intervening vertex), third circuit formed by joining 
vertices 3 positions apart, and so on (no subcircuits because n prime); 
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(c) Form complete graph with professors as vertices, answer: 8 days, using part 
(b) to form 8 Hamilton circuits out of the edges of K\ 7 . 

21. If x, y nonadjacent, direct all edges inward at both x and y, now x and y can 
only be on a Hamilton path if they are the last vertex on the path — cannot be 
two last vertices. 

23. Many possibilities — e.g., 


Section 2.3 

1. (a) 3, has K 3 , 

(b) 4, an attempt to 3-color outer 6 vertices gives b , d, f different colors so that 
c requires fourth color, 

(c) 3, not bipartite, 

(d) 4, a, b, e, g form a K 4 , 

(e) 3, odd circuit, 

(f) 2 , 

(g) 4, isomorphic to graph in 1(b), 

(h) 4, a, c, g, i form a K 4 , 

(i) 5, a, b, c, d, f form a K$, 

0) 3, 

(k) 2, 

( l ) 2 , 

(m) 4, an attempt to 3-color vertices clockwise around the circle starting at a 
forces j to have same color as a, 

(n) 4, graph contains a 5-wheel (see Example 2), 

(o) 4, graph contains a 5-wheel (see Example 2), 

(P) 3, outer pentagon (odd-length circuit) cannot be 2 -colored, 

(q) 4, an attempt to 3 color the sequence of vertices a, e, b, /, c, g, d forces d to 
have same color as a, 

(r) 4, a 3-coloring forces a and h to have different colors, similarly for i and p 
so that a, h,i, p act like a K 4 . 

3. (a) Many possibilities, (b) Many possibilities, 

5. 00, (g), (p), ( q ). 

7. (a) {a, b, c}, {d, e, f},{g,h], 

(b) {a,b,e},{c,d},{f,g}, 

(c) {a, b, c}, {d, e, /}, {g, h, i), { j , k}. 

9. No, Nevada (or Kentucky or West Virginia) and its neighboring states have duals 
that form odd-length wheels. 

11. 3. 

13. Vertices = classes, edges = two classes with a common student, colors = class 
times. 

15. Vertices = ships, edges = overlapping visit, colors = piers. 
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17. (a) Yes, 

(b) Yes. 

Section 2.4 

1. Proceed as in Theorem 5 using induction and the fact that there is a vertex x of 
degree 5, but since 6 colors are available, x is immediately colored with a color 
different from those used by its 5 neighbors. 

3. The chromatic number of G is the maximum of the chromatic numbers of its 
components. 

5. If the edge chromatic number is 2, the maximum degree must be 2. If the vertex 
chromatic number is 2, then the graph is bipartite. Combining these two facts, 
we see that the graph must be a path or an even-length circuit. 

7. (a) 3-color one triangle and then extend by successively coloring a vertex that 
lies on a triangle with two previously colored vertices, 

(b) Process in part (a) Yields unique coloring. 

9. (a) If not connected, then k colors are needed to color one of G’s components 
and removing a vertex from another component will not reduce x(G), 

(b) If deg(x) < k — 2, then if G — x could be k — 1 colored, also G could be 
k — 1 colored by giving x one of the k — 1 colors not used by one of x’s k — 2 
(or less) neighbors, 

(c) If x disconnects so that G — x has components G’ and G" then one of G’ IJ x 
or G" U x requires k colors and removing a vertex from the other component 
will not reduce x(G). 

11. (a) For n = 1, x(G)+ x ( G ) = 2; assume for n — 1 and consider an n -vertex 
graph G; by induction we can color G — x and G — x, for any given vertex 
x, with a total of n colors for both graphs (possibly fewer); x has a total 
of n — 1 edges in G and G and so is not adjacent to one of the n color 
classes in one of G or G and can be added to that color class, although in 
the other graph x may require an additional color — for a total of n + 1 , as 
required, 

(b) x(G)) > size of largest complete subgraph i n G = q , size of largest indepen- 
dent set in G, and by Exer. 8(a), x(G)<? > n, 

(c) Square both sides of inequality, new inequality follows immediately from (b) 
and the fact that a 2 + b 1 > 2 ab. 

13. ( k 2 — 6k + 8) = 0 when k = 4 and so Pk(G) = 0 for k = 4, but the Four Color 
Theorem says that any planar graph can be 4-colored (i.e., has a positive number 
of 4-colorings). 

15. If: label with numbers that are the length of the longest path starting at the ver- 
tex, adjacent vertices must have different-length longest paths since edge (xT y) 
implies that x’s longest path will be at least one greater that y’s longest path 
length, only if: let colors be numbers 0, 1, 1 and direct edges from larger 

to smaller numbers. 

17. Pick two nonadjacent vertices, each on one of the odd-length circuits, and give 
them a third color. The remaining graph has no odd-length circuits and hence 
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is bipartite (2-colorable). Some details are required to show that the two such 
non-adjacent vertices can be found. 

Supplement 



3. One labeled factor must be 1: W-B, 2: G-R, 3: B-G, 4: R-W, second labeled 
factor can be 1 and 2: G-B, 3: R-R, 4: W-W or 1: B-G, 2: W-B, 3: R-W, 4: G-R 
(or interchange 1 and 2). 

5. (a) Yes, a Hamilton circuit is a factor, 

(b) Having a Euler circuit has no relation to having a factor. 


CHAPTER THREE SOLUTIONS 

Section 3.1 

1- («) „ (b) 

> z 

(C) X NN l\Z l\Z LU X 

3. No odd circuits =>■ bipartite (Theorem 2 of Sect. 1.3) =>■ 2-colorable. 

5. (a) See answer to part (b), 

(b) Since G is connected, a subset of edges can be chosen to form a tree con- 
taining all vertices of G, but this subgraph has one fewer edges that vertices 
(by Theorem 1) and if G contained other edges, besides those in this tree, it 
would have as many vertices as edges, 

(c) If G has a circuit, removal of an edge an circuit does not disconnect G =>■ G 
has no circuits; now use part (a). 

7. Start at any vertex and trace a trail (no repeated edges); since no vertex is ever 
visited twice (if so, circuit would result) and graph is finite, trail must end at a 
vertex x of degree 1 ; now start trail-building again at x to get a second vertex of 
degree 1. 
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9. 


11 . 

13. 

15. 

17. 


Height h =>■ each path to a leaf passes through at most h internal vertices with 
a choice of m children to go to at each internal vertex, for a total of at most m h 
paths to a leaf. 

i In = [by Corollary part (a)] i /( mi + 1) » i /mi = 1 /m. 

Largest n — 1, smallest 2. 

A tree is bipartite, one vertex class in the bipartition must have at least n/2 
vertices. 

Unbalancing a tree: 






makes sum of level numbers larger and so smallest sum occurs when binary tree 
is balanced, in which case each level number is |_log 2 1 J or [log 2 / J + 1, and sum 
of level numbers is at least / |_log 2 1\ ■ 

19. (a) Internal vertices are +s 


(b) riog, 1001 = 7. 

21. 63. 

23. Leaves = letters, each leaf = /(-digit binary sequence =>• 2" letters. 
25. (a) 24, (b) 16, (c) 12, 

(d) 9 tournaments (original and 8 losers’ tournaments). 

log 2 = Llog 2 (« + 1)J - L (b) 


27. (a) 



A B C 


29. (a) 


12 3 T 


AAA 

1 L 3 H 2 L 4 L All 4 H 2 H 3 L 1 H 

fair 


(b) First weighing is either one coin on either side or two coins on either side, 
in either case some outcome has more than 3 possibilities that must be dis- 
tinguished in the one additional weighing (impossible, for example with one 
on each side, there are 5 possibilities if the scales balance). 
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Section 3.2 

1. (a) Any 8-vertex path, 

(b) Many possibilities, e.g., a-b-d-c-h-k-i-g-e-f-j, 

(c) Many possibilities, e.g., a-b-c-d-e-f-g-h ; many possibilities, e.g., any 6-vertex 
path. 

3. All trees on 5 vertices [see solution to Exer. 1(b) in Section 3.1], 

(b) All trees on 4 vertices [see solution to Exer. 1(a) in Section 3.1], 



5. 

7. 

9. 

11 . 


13. 


(d) A path of length 5 on a tree with 3 vertices of level 1 and 2 vertices at level 
2, places of d and c can be interchanged in last tree. 

4 components, xn, x \ 9 , X23 isolated vertices, depth-first spanning tree for other 
component has path X2-X4-X^-X3-Xg-Xi2-X^-XIQ-X3-Xi5-X20-Xi4-Xi^-Xis-X22-X24- 
X 26 -X 13 , plus edges from X20 to X25, from X 14 to X 7 to X21, and from X22 to Xn- 
If the connected graph is not a tree, it has a circuit, which contains an edge e not 
on the one spanning tree; a second spanning tree must exist that contains e. 

(a) If not all vertices reached, some reached vertex would have an edge to an 
unreached vertex, but a depth-first search would use such edge, 

(b) Immediate. 

If C has no edge of a spanning tree T, removal of C could not disconnect graph 
(spanning tree’s edges connected graph). 



15. (0, 0)-(0, 4)-(4, 0)-(4, 4) — now 2 quarts in 10-quart pitcher. 

17. Right-hand-wall rule is same as depth-first search, which takes leftmost branch 
at every intersection (corner). 

19. The ferryman takes goat across and returns alone; next he takes the dog across 
and brings the goat back; next he takes the tin cans across and returns alone, and 
finally he takes the goat across. 

21. Not possible. 

25. A and B cross (2 min.) and A returns (1 min.); next C and D cross (10 min.) and 
B returns (2 min.); finally A and B cross again (2 min.). 
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27 . 


Sin 



a + c e a b 



b c a d 


Section 3.3 

1. Cost is 11: 1-3-2-4-1. 

5. Cost is 14: T 1 -T 5 -T 4 -T 3 -T 2 -T 1 . 

7. (a) 1 -4-3-2, (b) 1-4-5-3-2-6-1, (c) 2-3-5-1-4-2. 

9. 2 1 2 
222 
2 1 2 

Section 3.4 

1. Outcomes are ordered lists. 

3. A binary comparison tree has n ! outcomes or leaves (as noted at the beginning 
of this section); so average number of comparisons = average leaf level = (by 
Exer. 17 in Section 3.1) log 2 /z ! = n log,/;. 

5. If the initial heap is balanced [as decribed in Exer. 4 (a)], then largest level 
number is log 2 /z and the number of comparisons to adjust the heap each time the 
root is removed will equal the largest level number — i.e., log 2 /z; for n iterations 
of removing the root and readjusting the heap, there will be n\og 2 n comparison; 
constructing initial heap is similar. 

9. (b) 0{n\og 2 n). 


CHAPTER FOUR SOLUTIONS 
Section 4.1 

1. Length 14: c-d-h-k-j-m. 

3. (a) 31: L-c-d-f-g-k-W, (b) 32: L-b-h-j-m-W , 

(c) 13: L-a-c-d-f-g-k-W, (d) L-c-d-f-g-k-W. 4 paths. 
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5. (a) 5 : L-b-h-j-m-W, 

(b) 6: L-c-d-f-g-k-W, 

(c) 6: L-c-e-g- j-m-W . 

7. If algorithm has found shortest paths to all vertices <m unit from a, then a vertex 
x will be distance m + 1 from a if and only if the length k(y, x ) from a vertex y 
(y is closer to a than x) plus the distance d(y) of y from a equals m + 1 — this is 
exactly the test that the algorithm performs. 

9. Many possible examples — e.g., a directed circuit from a to b to c to a with two 
edges of length 1 and the third edge of length— 3. 

11. Define a tree by letting the first label of a vertex be its parent. 


Section 4.2 

1. (a) 59: path L-a-c-d-h- f-g-i-k-W plus edges ( b , d), ( d , e), (g, j), ( k , l), ( k , m), 

(b) 60: replace ( k , W) by ( m , W), 

(c) L , j (other possibilities), 

(d) Modify part (b) by deleting (a, c), (g, i), and adding (a, b), ( g , k). 

3. (a, b) 39: path N-b-c-d-e-g- j-m-R plus edges ( d , h ), ( e , /), (/, i), (j, k). 

5. 


7. Modification: let initial edge be prescribed edge. 

11. (a) If all edges of shortest length do not form circuit and they are not all in T' , then 
add an omitted one and remove a longer edge in the resulting tree (as in proof 
of Prim’s algorithm) to obtain a shorter minimal spanning tree — impossible. 

(b) Same reasoning as in part (a), 

(c) If: part (b) is property of minimal spanning tree used to prove validity of 
Prim’s algorithm; only if: verified in part (b). 


Section 4.3 

1. max flow = 13, P — [a, b, c}. 

3. max flow = 50 , P — {/, z). 

5. (a) max flow = 13, P = { a , b, c), 
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(c) 


2 f 7 k 3 P 8 


i / 

7 8 

5 

14 n 

4 

' 10 « 

5 

11 ' 




/ 


12 

8 

16 

' 12 h 

4 m \ 

'10 r' 

' , 1 

\ 



i 


4 

14 

7 

7 i ' 

' 8 n ' 

' 10 s’ 

' 10 


j 




5 

4 

7 

' 3 7 ' 

1 9 

13 ' 

' 6 


7. Set capacities of edges out of a and into z equal to 100 (equivalent to unlimited 
flow), max flow = 150, P = {a, b, c, d}. 

9. 5, build paths by choosing leftmost unused edge leaving each vertex. 

11. (a) 3 (3 edges leaving L), 

(b) 15, quintuple flow in part (a). 

13. See Exer. 12 for modeling the vertex capacity constraint; max flow = 40, P — 

\.fh fo, 8oi Z . } ■ 

15. max flow = 2100, route 400 on ao-d2-Z3-Z4, 300 on CI0-C2-Z4, 400 on ao~b\-z?,-Z4, 
200 on ao-b\-C2-di-Z4, 400 on ciq, cii-d?,-Z4, 400 on ao-ci\-b2-Z4- 

17. max flow = 36. 

19. (a) Algorithm tries to reduce flow in incoming edge before adding flow to out- 
going edge, 

(b) Yes. 

21. Define tree with a vertex’s parent being its first label. 

23. (a) Put flow of 1 in the two edges between c and e; other edges no flow, 

(b) Take flow in (a); and add a 2-unit flow path a-d-f-z. 

27. If each flow path crosses a cut once, then the capacity of the cut = sum of values 
of such flow paths = value of flow. 

29. (a) This is max flow-min cut theorem for model in Example 6, 

(b) Make edge capacities 1 and build vertex constraints of 1 (see Exer. 12), result 
is max flow-min cut theorem for this network. 

31. One needs to build an initial feasible flow (requires at least one edge entering 
and one edge leaving each vertex, except a and z), then use same algorithm to 
reduce, instead of increase flow along a flow path. 

33. (a) In step 2a of augmenting flow algorithm, incoming flow cannot be reduced 
below its lower bound value; otherwise use same algorithm. 


456 Solutions to Odd-Numbered Problems 


Section 4.4 

1. (a) Several possibilities, see (b), 

(b) Using pairings A-G, Lo-J , first labels are (except for a, all second labels are 
1): Bo-a+, F-Bo+, G-Bo+, C-F~, A-G~, J-C+, Bi-A+, Lo-J D-Bi~, 
La-Lo + , z-La + , new matching A-G, D-Bi, Lo-La, Bo-F, C-J . 

3. Give edges from a and into z the appropriate supplies and demands. Middle edges 
still oo, one solution: A-Bi (3 dates), D-Bi (2), Bo-F (1), Lo-F (2), C-F (1), 
A-G (1), Bo-G (2), C-J (3), Lo-J (2), D-La (3). 

5. No, schools with demand of 7 Ph.D.s can hire at most 6 (one from each university). 

7. (a) Can be co-champions with either Lions or Tigers, 

(b) Not possible; Lions and Tigers must play each other 3 times, but each team 
can only win once. 

9. Only possible to be co-champions, jointly with the other three teams. 

11. (a, b) Make complete bipartite graph, each vertex of degree n; by Example 3 
there is a pairing for the first night; remove these edges; now again by Example 
3, there is a pairing for the second night; continue in this fashion. 

13. Start with a standard set-of-distinct-representatives matching network; replace 
the source a with 3 sources (one for each university), each with capacity m/3 and 
unit-capacity edges to each university’s graduates. 

15. Necessary and sufficient condition: for any set S of vertices, |5| < |s(S)|, where 
s(S) is the set of successors of vertices in S (vertices with an edge coming in from 
a vertex in ,S'); this condition guarantees A complete matching in hinted bipartite 
graph, which corresponds to a set of edges in original graph with one edge out of 
and one edge into each vertex. 

17. For each left-side vertex, x, form a set of x’s (right-side) neighbors. 


Section 4.5 


1 . 

3. 

5. 

7. 


(a) xn — 30, X21 = 20, x 2 2 — 10, (b) x n = 20, x i2 = 10, X22 — 30. 

(a) xn — 30, X22 = 30, X31 = 10, X32 = 20, 

(b) X12 = 30, X21 = 10, X22 = 20, X31 = 30. 

(a) X13 = 30, X21 = 20, X23 = 10, X32 = 20, X33 = 10, 

(b) X12 = 20, X13 = 10, X21 = 20, X23 = 10, X33 = 30. 

(a) xn = 30, X23 = 30, X32 = 10, X33 = 20, X42 = 30, 

(b) X13 = 30, X21 = 30, X31 = 10, X33 = 20, X42 = 30. 


CHAPTER FIVE SOLUTIONS 
Section 5.1 

1. (a) 12, 20, (b) 3, 6. 

3. 8 x 12x5x4. 

5. 26 s , 26 x 25 x 24 x 23 x 22 x 21. 
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7. (a) 10 + 6 + 4, 

(b) 10x6x4, 

(c) 3 x {10x9 x(6 + 4) + 6x 5 x(10 + 4) + 4x 3 x (10 + 6)}. 

9. (a) 4 x47= 188, 

(b) (1 x 48) + (12x47). 

11 . 12 . 

13. See Supplement at end of Chapter 5. 

15. 52 x 48/52 x 51. 

17. (a) 10 x 9 x 8 x 8, 10 x 9 x 8"~ 2 , 

(b) 9 x 8 x 1/9 x 8 x 8, 

(c) 9 x 8 x 7 x 1/9 x 9 x 8 x 8. 

19. (6 x 5 x 4) - (5 x 4 x 3). 

21. (6 x 5 x 4) - (4 x 3 x 2) or (2 x 3 x 4 x 3) + (3 x 2 x 4). 

23. (a) 9 x 9 x 8 x 7, 

(b) 7 x 8 x 8 x 8, 

(c) 9x9x8x7 — 7 x 7 x 6 x 5. 

25. 10 x 9 x (2 5 — 2)/2 

27. 4 5 — 3 x 4 2 
29. 15 x 10(14 + 9). 

31. 4 x 10 3 . 

33. See Supplement at end of Chapter 5. 

35. 2 x 25/50 x 49. 

37. [(3 x 3 x 2) + (3 x 3 !)]/6 3 — pick the possible smallest value (3 choices), count 
ways of picking middle value and then arranging them (3x2 possibilities when 
middle value same as largest or smallest). 

39. 9 x 7 x 5 x 3. 

41. 16 x 8 x 7/2, n x m x (m + n — 2)/2. 

43. {[28 x (64 — 22) (Queen on edge of board)] + [20 x (64 — 24) (Queen one away 
from edge of board)] + [12 x (64 - 26)] + [4 x (64 - 28)]}/2, 

45. 2 10 — 1 — each friend is or is not the subset (subtract 1 for the case of no one 
invited). 

47. 2x3x6. 


Section 5.2 

1. 52! 

3. P(13, 6). 

5. P(7, 4). 

7. C(14, 5), C(12, 3). 

9. {C(ll, 9) + C(ll, 10) + C(11, ll))/2 n 

11. (a) C(12, 2) x C(10, 2) x C(8, 2) x C(6, 2) x C(4, 2), (b) 6! x 6!. 

13. 7 x C(6, 2) x 24 4 . 

15. C(n, 8)2" -8 . 

17. C(n,k)x 2 n ~ k /3 n . 

19. C(10, m) x C(10 — m, n) x 24 10 -'”-" . 
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21. (a) C(4, 2) x {C( 6, 4) + C(6, 5) + C(6, 6)} + [C(4, 3) x C(6, 6)], 

(b) C(9,3) + C(9,4) + C(9,5), 

(c) C(10, 5) — C(7, 2), 

(d) {[<7(4, 2) x C(6, 2)] - (3 x 5)} + {[<7(4, 1) x C(6, 3)] - C(5, 2)} + C(6, 4). 
23. (a) {C(4, 2) x 9 2 + C(4, 3) x 9 + C(4, 4)}/10 4 , 

(b) C(10, 2) x (2 4 - 2)/10 4 , 

(c) C(10, 4)/10 4 . 

25. 3! x 6! x 8! x 5!. 

27. C(21, 5) x 10!; 2 x 5! 2 /10!. 


29. 

(a) 

2 x 5!/6! = 1/3, (b) 

C( 6, 2) 

x 4!/6! 


31. 

(a) 

C(n, 3), 





(b) 

C(n — m, 3) + [m x C(n - 

- m, 2)] 



33. 

12. 





35. 

(a) 

1-vote 1/6, 2-vote 1/2, 


(b) 1-vote 

1/6, 2-vote 1/3, 


(c) 

1-vote 468/7!, 2-vote 1056/7!, 

(d) 1 -vote 

396/7!, 2-vote 864/7! 

37. 

(a) 

26!/2, (b) 26!/2 2 , 





(c) 

C(26, 5) x 21!. 




39. 

C(f 

i, 3) x P{6, 3) x P(21, 3) - 

fC(5,4)xF(6,4)x 

P(21, 2) + 21 x 6!. 

41. 

4 x 

3! x C(7, 2) x C(5, 2) x ; 

1! 




43. C(30, 5) 12 /C(360, 60). 

45. (a) {[(3 x 4) + (2 x 5)] x 8!}/ 10!, 

(b) {(1 x9x 8!) + (8 x 8 x 8!)}/10!. 

47. (a) {[<7(4, l) 4 x C(36, 1)] + [4 x C(4, 2) x C(4, 1) 3 ]}/C(52, 5), 

(b) {C(26, 5) + <7(13, 1) 2 C(26, 3) + <7(13, 2) 2 x C(26, 1)}/C(52, 5). 

49. See Supplement at end of Chapter 5. 

51. {C(30, 2) x C(28, 2) x C(26, 2) x C(24, 2)}. 

53. 4 x 3 x 8!/2!/2!. 

55. (a) See Supplement at end of Chapter 5. 

57. C(64, 8) x C(56, 8). 

59. See Supplement at end of Chapter 5. 

61. C(6, 3) x C(4, 2) + 2 x [C(6, 2) x C(4, 2) + C(6, 3) x C(4, 1)] + [6 x 
C(4, 2) + C(6, 2) x C(4, 1) + C(6, 3)] 

63. See Supplement at end of Chapter 5. 

65. (a) C(10, 2) x C(k - 10, 18)/C(£, 20), 

(b) k = 100. 

67. C(12, 5) x C(4, 2) x 2 x C(7, 2) x C(5, 2) x C(3, 2). 

69. See Supplement at end of Chapter 5. 

71. (a) C(45, 3) + C(45, 2) x C(45, 1), 

(b) [3 x C(30, 3) (three integers with same value mod 3)] + C(30, l) 3 (each 
integer with different value mod 3), 

(c) as in (b), break into cases based on the value mod 4: C(22, 3) + [C(23, 2) x 
C(22, 1)] + [2 x C(23, 2) x C(45, 1)] + [C(23, 1) x C(22, 2)] + 22 2 x 23. 

73. C(1 1, 6) x C(5, 2) x 5!/2!. 

75. / > (20, 3) 6 , 

77. P(C{ 8, 2), 12), 

79. 8!/2!2! + (8!/2! - 7!/2!) - (7! - 6!/2!) 
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81. C(8,3), C(8, 3) -(8x5). 

83. See Supplement at end of Chapter 5. 

85. (a) C(10,6), 

(b) C(10, 6) + [5 x C(10, 5)] + [4 x C(10, 4)] + C(10, 3). 

87. See Supplement at end of Chapter 5. 

Section 5.3 

1. 7!/3!2!2!. 

3. (a) 3 8 , 

(b) (8!/3!2!3!)/3 8 . 

5. C(9 + 4- 1,9). 

7. C(ll + 3—1,11), subtract cases where one party has a majority (9 or more) 
C(ll + 3 — 1, 11) — [3 x C(4 + 3- 1,4)]. 

9. (a) C(9 + 3 — 1, 9), 

(b) C(16 + 3 — 1, 16) — (7(5 -h 3 — 1,5). 

11. C(5,2)xC(6 + 2— 1,6). 

13. 6!/3!2!l! + 6! /2!2! 1 ! 1 !. 

15. C(10, 6) x {[C(6, 2) x 8!/2!2!l ! 4 ] + [6 x 8!/3!l ! 5 ]}. 

17. Consider the cases of: (i) 4 of one letter and 1 of another letter, (ii) 3 of one kind and 
2 of another letter, (iii) 3 of one letter and 1 of two others, (iv) 2 of two letters and 1 
of another, and (v) 2 of one letter and 1 of three others — (2 x3x5!/4!l!) + (2x 
2 x 5!/3!2!) + [2 x C(3, 2) x 5!/3!l!l!] + [C(3, 2) x 2 x 5!/2!2! 1 !] + (3 x 5!/ 
2!1! 3 ). 

19. 3 x 10! /2 !4! . 

21. 10!/2!2! — 9!/2!2!. 

23. (a) 10!/4!3!2!, 

(b) 2 x 10!/4!3!2! — 9!/4!2!2!. 

25. C(8 + 3 - 1, 8) - C(5 + 3 - 1, 5). 

27. See Supplement at end of Chapter 5. 

29. Counts all (1, 2, 3)-sequences of length 10 two ways: left side looks at all cases of 
k\ Is, ki 2s, and £2 3s, while right side counts all (unrestricted) 10-digit sequences 
of Is, 2s, 3s. 

31. See Supplement at end of Chapter 5. 

33. See Supplement at end of Chapter 5. 

35. Must have b or d after each c (except possibly last c); sum is over number of bs 
following cs (first sum when last c followed by b or d, second sum when last c is at 
end of sequence: "£ C(3, it)21!/7!3!(8 - k)\[6- (3 -*)]! + £ C(2, £)20!/7!2! 
(8 — *)![ 6 — (2 — *)]!. 

Section 5.4 

1. (a) C(27 + 3- 1,27), 

(b) 1, 

(c) C(24 + 3- 1,24). 

3. (a) 13!/4!4!3!2!/C(52, 13), 

(b) [13!/4!4!2!3! x 39 !/9 !9 ! 1 1 ! 10 !]/(52 !/ 1 3 ! 4 ), 
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(c) 4 x C(48, 9)/C(52, 13), 

(d) 4! x (13!/4!3! 3 ) 4 /(52!/13!) 4 . 

5. C(10 + 4— 1, 10) x C(4 + 4 — 1, 4) x C(6 + 4- 1,6). 

7. C[(5 — 2) + 4 — 1, (5 — 2)] x 5!. 

9. (a) C(9, 4) x 8!/2! x 5!/2!2!, 

(b) C(9,4) x 8!/2!. 

11. C[(14-3) + (5- 1), (14 — 3)]/C(18, 14). 

13. C(15 + 3—1, 15). 

15. (a) [52 !/ 1 3 ! 4 ]/4 !, 

(b) [52!/8! 3 7! 4 ]/3! x 4!. 

17. 1 - (10! - 10 x 9 x 10!/2!)/10 10 . 

19. (a) 4 3 x C(9 + 4— 1, 9), 

(b) P(4, 3) x C(9 + 4 — 1,9), 

(c) Distribute teddies and fill out each child with lollipops: 4 3 . 

21. 4 x C(4 + 4 — 1, 4). 

23. 2xC(7 + 4— 1,7)— 1. 

25. C[(7 — 3) + 5 — 1, (7-3)] x 7!/4!2!l!. 

27. See Supplement of Selected Solutions at end of Chapter 5. 

29. [10!/2! 5 ]/5 10 . 

31. (a) Distributions of 8 distinct items into 3 boxes, 

(b) Distributions of 9 distinct items into 3 boxes with two items in the first box, 
etc. 

33. (a) Distributions of 6 identical objects into 31 boxes, E/Hi x, = 6, 

(b) Distributions of 5 identical objects into 3 boxes with at most 4 objects in first 
box, etc., Ef=i x i = 5, x\ < 5, X2 < 4, X3 < 2. 

35. C(30 + 3 - 1 , 30), 3 x C[(30 - 16) + 3 - 1 , (30 - 16)]. 

37. (a) C(7 + 4- 1,7), 

(b) C(7 + 5-l,7)+l 

(c) [C(13 + 4—1, 13)] — [4 x C(3 + 4 — 1)]. 

39. Tl=t>C{k + 2- l,k) x C[(12 — 2k) + 2—1, (12 — 2k)]. 

41. ELo C(7 + 3 - 1 , 7) x C [(20 - k) + 4 - 1 , (20 - k )} . 

43. (a) 5 x 6 x 8, (b) (5 x 6 x 8) - 2. 

45. 4! x C(4 + 5- 1,4). 

47. See Supplement at end of Chapter 5. 

49. C(3 + 4— 1, 3) x 5!/3! 

51. (a) 5! x C(9, 5), (b) 21! x 5! x C(22, 5). 

53. Elio C(13, k ) x C(39, 13 - k) x C(26 + k, k) x C(26, 13). 

55. E 15!/a!i>!c! summing over all a, b, c 3-tuples where a + b + c = 15 with no 
letter greater than 7. 

57. (3! x 25!/16!8!l!) + (3 x 25!/14!7!4! x2 4 ) + {3x [25!/12!6!7! x(2 7 -2)- 
25!/6!6!l !]} + 25!/10!5! 3 . 

59. See Supplement at end of Chapter 5. 

61. See Supplement at end of Chapter 5. 

63. C[(n — 2m) + (2m + 2) — 1 ,(n — 2m)] {simplifies to C(n + 1, 2m + 1). 

65. ELo C(k + m-l,k)x C[(r -k) + (n - m) - 1, (r - it)]. 
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Section 5.5 

11. (b) [C(n + 1, 2)] + [6 x C(n + 1, 3)] + [6 x C(n + 1, 4)]. 
13. (a) 2" + (n x 2 n ~ l \ (b) 1.5 x 2". 

21. \ x C(2n + 2, n + 1) - C(2n, n). 

23. 6. 


CHAPTER SIX SOLUTIONS 
Section 6.1 

1. (a) 7 products — xxxx, x 3 llx, x 3 lxl, x 3 xll, lx 3 lx, lx 3 xl, xx 3 ll, 

(b) 5 products — lx 4 , xx 3 , x 2 x 2 , x 3 x, x 4 l, 

(c) 7 products — x 4 lll, lx 4 ll, x 3 llx, x 3 lxl, lx 3 lx, lx 3 xl, llx 2 x 2 , 

(d) 15 products — x 4 ll, x 3 xl, x 3 lx, x 2 x 2 l, x 2 xx, x 2 lx 2 , xx 3 l, xx 2 x, xxx 2 , 
xlx 3 , lx 4 l, lx 3 x, lx 2 x 2 , lxx 3 , llx 4 . 

3. (a) (1 +x + x 2 + x 3 +x 4 +x 5 ) 2 (l +x + x 2 + x 3 + x 4 ), 

(b) (x + x 2 + x 3 + x 4 + x 5 ) 2 (x + x 2 H h x 7 + x 8 ), 

(c) (l+x + x 2 + ---)\ 

(d) (x + x 3 + x 5 H ) 2 (1 + x + x 2 H ) 4 . 

5. (l+x + x 2 H ) 2 (1 +x) 2 , coef. of x 5 . 

7. (x 3 + x 4 + x 5 + ■ • -) 4 , coefficient of x 16 . 

9. (1+x + x 2 + •••)". 

11 . (1 + x 2 + x 4 + • ■ -)(x +x 3 +x 5 H — -)( 1 + x + x 2 H — 

13. (x + x 2 + x 3 + x 4 +x 5 +x 6 ) ,! . 

15. (x~ 3 + x~ 2 + x _1 + 1 + x + x 2 + x 3 ) 4 . 

17. (l+x 5 +x 10 + ---) 8 . 

19. (1 + x + x 2 H )(x + x 2 + x 3 H ) 4 (1 +x + x 2 H ), coefficient of x 15 , gen- 

erally x" -5 . 

21. Cannot have a variable number of factors. 

23. (l+x + x 2 + ---)(l+x 5 +x 10 + ---)(l+x 10 + x 20 + ---). 

25. (1 + x + x 2 H ) 5 (1 + y + y 2 + y 3 ) 5 . 

27. (xy + xz + yz) 8 - 
29. C(p, n), n\. 


Section 6.2 

1. C(10 + /1— 1, 10). 

3. C(m, 7) + C(m. 5) + C(m, 3). 

5. C(ll + 3—1, 11) — C(3, 1) x C(5 + 3—1, 5). 

7. C(1 1 + 7— 1,11) — 7x C(6 + 7 —1,6) + C(7, 2) x C(1 +7- 1, 1). 
9. C(12 + 4— 1, 12)- 4 x C(5 + 4- 1,5). 

11. (a) C(9+ 10- 1,9), 

(b) C(9 + 4— 1, 9) — 3 x C(10 + 4— 1. 10), 
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(c) 0, 

(d) C(10 + 2— 1, 10) + 3 x C(1 1 + 2 — 1, 11), 

(e) b m x C(ll, m). 

13. 0. 


15. (a) 0, 


(b) 1, 

(c) C(12 + 8 — 1, 12), 

(d) 4 12 x C(12 + 5- 1, 12), 

(e) C(4 + 4- 1,4). 

17. (a) (x 2 +x 3 +x 4 + . . ,) 3 , C((10 — 6) + 3 — 1, (10 — 6)), 

(b) (l+x + x 2 )(l+x + x 2 + ...) 2 , C(10+ 3 — 1, 10)-C(7 + 3- 1,7), 

(c) (l+x 2 +x 4 + ...)(l+x + x 2 + ...) 2 , £C((10-2ifc) + 2- 1,(10- 2k)). 
19. C(12 + 5 — 1, 12) — 5 x C(7 + 5 — 1,7) + C(5, 2) x C(2 + 5 — 1,2). 

21. (a) C(15 + n— 1, 15) + C(10 + « - 1, 10), 

(b) C(n, 15 ) + C(n, 10). 

23. C(8 + 7 — 1, 8) — 7 x C(3 + 7 — 1, 3). 

25. C(6 + 3 — 1, 6) — C(3 + 3 — 1, 3) — C(2 + 3 — 1,2). 

27. C(14+10-l, 14)-4x C(10+ 10- 1, 10) - 6 x C(7 + 10 - 1, 7) + 

C( 4, 2)xC(6+10— 1,6) + 4x6xC(3+10— 1,3) — 4xC(2+10— 1,2) + 
C( 6, 2). 

29. [C(10 + 4— 1, 10) — 4 x C(4 + 4 — 1,4)] x [C(15 + 4—1, 15)-4xC(9 + 
4-1,9)} + C( 4, 2) x C(3 + 4 - 1, 3)]. 

33. (a) C(jn +n,m + r), 

(b) 0, if r odd, C{n, r/2), r even, 

(c) 3". 

35. (b) 2. 


39. (b) n/2, pn, 10, m/p. 


41- PAt) = ( - 


l\ m fl-(t/2Y +1 
l-(f/2) 


m 


Section 6.3 


1. (a) 5 partitions — 4, 3+1, 2+1 + 1, 2 + 2, 

(b) 1 1 partitions — 6, 5+1, 4 + 2, 4+1 + 1, 3 + 3, 3 + 2+1, 
2 + 2, 2 + 2+1 + 1.2+1 + 1+1+1, l + l + l + l + l 
3. (1 + x + x 2 + x 3 )(l +x 2 +x 4 +x 6 )(l +x 3 + x 6 + x 9 ) 


5. (1 -x)-*(l -x 5 )” 1 !! -x 10 )-'(l -x 25 )" 1 . 


1 -x 2 

1 — X 

1 


1 


1-x^ 


l-x c 

1 — X- 


1,2 + 


1 -x> 
1 -x 4 


7. (b) (l+x)(l+x 2 )(l+x 3 )...= 

= (after canceling) — . 

(1 — x)(l — — X 3 ) . . . 

19. (a) Multiply the partition generating function (given just before Example 1 ) times 
1 


1 — x ’ 

(b) (x 3 +x 6 )/(l ■ 

(c) Same as (b). 


-x 3 )(l-x 4 )(l-x 6 ), 
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21. (a) Let the number of dots forming the first row and first column (= 2k — 1 if 
first row and column have length k ) in a self-conjugate Ferrers diagram be 
the length of the first row in the distinct, odd-parts diagram; delete the first 
row and column of the self-conjugate diagram and use the number of dots in 
the reduced self-conjugate diagram to define the second row of the distinct, 
odd-parts diagram, etc. 

Section 6.4 



3. (l+x) 5 e 2bc . 

( 2 3 4 \ 13 

1+xH 1 1 | , coefficient of jc I3 / 13 !. 

2! 3! 4!/ ' 

7. (a) 1/2(3' -I- 1), 

(b) l/4[3 r +2 + (-l) r ], 

(c) 3' — 2x2' +1. 

9. (a) 22 10 + 4 x P(10, 1) x 22 9 + 6 x P(10, 2) x 22 s + 4 x P(10, 3) x 22 7 + 
P(10, 4) x 22 6 , 

(b) 26 10 - 4 x 25 10 + 6 x 24 10 - 4 x 23 10 + 22 10 . 


11 . U r . 


13. (a) e x -2x e (n - 1)x/n + e°'- 2 )x/ n , 
(b) [n r - 2 (n - I)'' + (n - 2 ) r ]/n r . 
15. (a) (e x — 1)/jc, 

(b) (1 — jc)- 1 . 

21. e^-b. 

Section 6.5 

1. (a) x/(l -x) 2 , 

(b) 13/(1 -x), 

(c) 3x(l + x)/(l — x) 3 , 

(d) [3x/(l — x) 2 ] + [7/(1 — x)], 

(e) 4 !x 4 /(1 — x) 5 . 

3. x(3 — x)/(l -x) 3 . 

5. (a) (4x 2 — 3x + 1)/(1 — x) 3 , 

(b) log ^(l x). 


CHAPTER SEVEN SOLUTIONS 

Section 7.1 

1. a n = 5a n -\ , a\ — 5. 

3. (Xf\ — — i “f“ &n— 2* 

5. Cl n — dfi—l + “I - d n —\ o “1“ dfi — 25 • 

7 * d u — | | — 2 * 
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9. O n — Cln— l + O n —2 m 

XI. On — On — 1 + On — 2* 

13. (a) a n = a n -\ + n,n > k, initial condition: ak = k+ 1, 

(b) 43. 

15. a n = 1.06(a„_i +50). 

17. On — 2o n — i + 2 On — 2 + 2on—3- 
19. (a) O n — O n — 1 + O n — 3 + O n —5i 

(b) O n — 1 + tl n — 3 + tin— 5 t2 n —6i 

(c) — O n — i + On— 3 + tin— 5 tl n —9' 

21. + 3" _1 . 

23. a n = 2a n -i + 4 W_1 . 

25. — 3nf w _i o n — 3 • 

27. = (2n — l)<z n _i. 

29. = O n ,k— 1 + 1,&— 1 + tin— 2, k— l + ^/z— 3,&— 1* 

31. 1 ,A: — 1 + tln—k,k • 

33. — b n —\ + C/2—i , b n — c n — 2 b n —\ ^*/i — 1 • 

35. = such sequences starting with a 0, = such sequences starting with a 1 , 

c n = h - digit binary sequences with no consecutive Os, a n = b n ~\ + c n _3, b n = 

tin— 1 + hn—\t Cn = Cn—1 + £«— 2* 

37. p n = ways to hand out a penny, nickel, or dime on successive days with a penny 
on the first day, n n and d n are defined similary, p n = n n ~\ + d n -\, n n = p n - 5 
+ d n —5, d n — /7 w _io + ^/z— 10* 

39. 22 w — CL n _\Cl3 + U n _2Cl4 + ■ * * + . 

41. (a) 3,0, 

(b) 2n- 1,2, 

(c) 3 n 2 — 3/2 + 1, 6« — 6. 

43. (a) a„ = 2a n -i, 

(b) a n = r~\ 

(c) The first k integers in the sequence will form a set of consecutive integers, 
the ( k + l)-st integer can be the next larger or the next smaller number to 
extend this consecutive set. 

45. a n =a n - u n not a multiple of 5, a„ = a„-i + 1, // a multiple of 5 but not 10 or 
25, a„ = a„_i + 2, n a multiple of 10 or 25. 

n + 1 

47. a n = a„_3, n even, a„ = a„_ 3 + — - — . 

Section 7.2 

1. (a) A/i -5, (b) Ah 1/2 + 2h, 

(d) An - 2, (e) a n = An 4 - 5n/l, 

3. (a) a n = a n /io +l,a n = log 10 n, 

(b) a„ = 10fl„/io + 1, a n = \n-\. 

5. a n = 2a n /2 + n — 1 , a„ — n log 2 n — n + 1 . 

9. (a) Pick largest from first half and largest from second half and compare, 


(c) A + 4n - Llog 2 (n)J, 
(f) a„ = An 2 — 3 //. 
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(b) tin — Hu H /2 4~ 1 5 

(c) a n = n — 1 . 


Section 7.3 

1. (1.08)" x 1000. 

3. (a) a„ = |4 n + |( — 1)", (b) a n = 1, 

(c) a n = 2, (d) a„ = \n 2 — ^n+ 1. 

2 _ j _ ^2 2 a /2 

5. a n = 2 a,,-! + a n — 2 , a n = — ^-(1 + V2) n + — ^-(1 - V2)' ! . 

7. p„ - /?„_! = 2(/?„_i - /?„_ 2 ), Pn — (3 X 2") - 2. 

11. ci = 9, c 2 = —18. 

Section 7.4 

1. (a) a n = 3C(n,2) + 1, 

(b) = 2 C(n + 1, 3) + 3, 

(c) a n = 6C(n + 2, 3) - 3 C(n + 1, 2) + 10. 

3. a n = a„_ i + 2C(n, 2) + n,a„ = 2 C(n + 1, 3) + C(n + 1, 2). 

5. a„ — a„- 1 + Y^'IcZilk x (n — 2 — k) + 1 } , a„ — C(n , 4) + C(«, 2) — n + 1. 
7. a„ = 1250(1.04)" - 1250. 

9. (a) —3" + 1, (b) §2" + f(-l)", 

(c) (3 x 2") - n - 2, (d) (15 x 2") - 2 n 2 -8 n- 12. 

11. 3 x 2" - 3n - 2. 

13. a„ = 2a„_i — a„_ 2 + (10 x 2 k ), a„ = 960« — 20 + (40 x 2"). 

17. Ax2" + 5x2" + |«+f. 

19. (a) a n = V2" +1 - 1, 

(b) a n =n\, n even, a„ — 0, n odd. 


Section 7.5 


2x 

1. (a) g(x ) - 1 = xg(x ) + , 

1 —x 

2x 2 

(b) g(x) - x - 1 = 3 xg(x) - 3 - 2x 2 g(x) + — — , 

1 — x 

2x 2 

(c) g(x) - 1 = xg(x) + — - — — , 

( 1 - xy 
2x 

(d) g(x) - 1 = 2xg(x) + ~ . 

1 — 2x 

1 

7. a n = cha n - 1 + fl 3 fl„_ 2 + • — h a„_ifl 2 , n > 3, a 2 = 1, a n — C(2n — 4, 

n — 1 


n - 2). 

9. a„ = aia n -i + a 2 a„_i H — • + n >2,a\ = l, a n = —C{2n — 2, n — 1). 

n 

11. a n<k = pa n -\ tk - 1 + qa n -i' k , F n (x) = (q + px) n . 
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13. Similar to recurrence relations in Example 5 except with 4" 1 replacing 3" 
and a\ = 0 instead of a\ = 1 (still b\= c\ = 1), a n — -4(4" — 1), n even, 
A(4«_4) ; „ odd. 

15. (a) a n = J2 C{n - 1, k - 1 )a n - k , 

(b) g(x) = e e ‘~ 1 . 


CHAPTER EIGHT SOLUTIONS 
Section 8.1 

1. 26 s - 21 2 x 26 6 . 

^ ^ n 3 x 2 n_1 

5. {C(52, 7) - C(13, 7) x C(4, 1) 7 )/C(52, 7). 

7. 5 x (9 4 - 9 x 8 x C( 4, 3). 

9. 700-200- 180- 150. 

11. (a) 300-70- 100 + 40. 

(b) 300- 100-60. 

13. (142 + 500 — 71)/ 1000. 

15. (a) 200 - 3 x 85 + 3 x 30 - 15, 

(b) 85-2 x 30+ 15. 

17. 30-15-10-6 + 5 + 3 + 2-1. 

19. 3 20 — 3 x 2 20 + 3 x 1. 

21. 6 !/2 ! 3 — 3 x 5!/2! 2 + 3 x4!/2! — 3!. 

23. C(37, 10) - [C(27, 10) + C(25, 10) + C(22, 10)] + [C(15, 10) 
+ C(12, 10)+ 1], 

25. C(52, 6) - 3 x C(48, 6) + 3 x C(44, 6) - C(40, 6). 

27. 3 x C( 6, 2) x 4! - {2 x C( 6, 3) x 3! + 6!/2! 2 } + C( 6, 4) x 2!. 
29. (15I/3! 5 — 3 x 5! x 101/2 ! 5 + 3 x 5 ! 3 — 5 ! 3 )/(15 !/3 ! 5 ). 

31. 20. 

33. N(Y) = N(Y -K) + N(Y (1 K) = 50 + 20. 

35. 25, 5. 


Section 8.2 

1. 10'" — 3 x 9"' + 3 x 8'" — 7'” . 

3. 13 x C(48, 5) — C(13, 2) x 44. 

5. (a) {C(52, 13) — C(4, l)x C(39, 13) + C(4, 2) x C(26, 13)-C(4, 3) 
x C(13, 13)}/C(52, 13), 

(b) {C( 4, 1) x C(39, 13) - C( 4, 2) x C(26, 13) + C( 4, 3) 
x C(13, 13)}/C(52, 13), 

(c) {C(52, 13) - C( 4, 1) x C(48, 13) + C( 4, 2) x C(44, 13) - C( 4, 3) 
x C(40, 13) + C(36, 13))/C(52, 13). 

7. 91/3! 3 — 3 x 71/3! 2 + 3 x 51/3! — 31. 

9 . 26! — {3 x 23 ! + 24 1 } + { 2 x 20 1 + 2x211} — 18!. 
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11 . C(26 + 6 — 1, 26) — 3 x C(19 + 6- 1, 19) + 3 x C(12 + 6- 1, 12) 

— C(5 + 6—1,5). 

13 . C(10 + 4—1, 10) — C(4, 1) x C(8 + 3 - 1, 8) + C(4, 2) x 7 - C(4, 3) x 1 . 

15 . 27. 

17 . «sl0! 2 /e. 

19 . 10!/2! 5 x { 10!/2! 5 - C(5, 1) x 8!/2! 4 + C(5, 2) x 6!/2! 3 - C(5, 3) x 4!/2! 2 
+ C(5, 4) - C(5, 5)}. 

21 . 15!/3! 5 - 5 x 5 x 12!/3! 4 + C(5, 2) x 5 x 4 x 9!/3! 3 - C(5, 3) x 5 x 4 x 3 
x 6!/3! 2 + 5 x 5! — 5!. 

23 . (a) n 5 - C(5, l)xn 4 + C(5, 2) x n 3 - C( 5, 3) x n 2 + {C(5, 4) - C( 5, 5)} x re, 

(b) n 5 — C(5, l)xn 4 + C(5, 2) x n 3 - {(C(5, 3) - 1 ) x n 2 + n 3 } 

+ {(C(5, 4) - 2) x n + 2 x n 2 } - n. 

(c) n 5 - C( 7, 1) x n 4 + C(7, 2) x n 3 - {C(7, 3) - 3) x n 2 + 3 x n 3 } 

+ {(C(7, 4) - 14) x n + 14 x « 2 } - {(C(7, 5) - 2) x n + 2 x n 2 } + {C(7, 6) 
-C(7,7))xn. 

27 . If 21 = C(2 + 6- 1, 2), then E(-l)* x C(6, k) x (21 -*)". 

29 . 5!" x {(2n — 1)! — E C(n, A:) x (2n — 1 — £)!}. 

31 . P(C(7, 3), 7) - 7 x P(C( 6, 3), 7) + C( 7, 2) x P(C(5, 3), 7). 

33 . C(P(6, 3), 8) - 6 x C(P(5, 3), 8) + C(6, 2) x C(P(4, 3), 8). 

35 . { E(-l)* x C(n,k)x(n-k) r j/n!. 

37. IX3 (-1)*+' xC(k,?>)xC{n,k)x(ji-k) r , T,kZl(~ l ) k+1 xC(A:- 
x C(«, A;) x (« — A:) r . 

39 . [C(5, 2) x 9!/2! 3 ] - [2 x C(5, 3) x 8!/2! 2 ] + [3 x C(5, 4) x 7!/2!] - 

47. EZ=o* x { x C(j, k) x «!/;!}. 

Section 8.3 

1. 5 x 5 board with darkened squares on main diagonal. 

3 . 5! — 8x41 + 20x3! — 16x2! + 4x 1!. 

5 . 7! - (9 x 6!) + (30 x 5!) - (46 x 4!) + (32 x 3!) - (8 x 21). 

7 . 51 — (7x4!) + (16x3!) — (13x2!) + (2 x 1!). 

9 . 3. 

11 . (a) 4 x 5 board with darkened squares in 4 positions just to right of main diagonal, 

(b) (x + l) 4 , 

(c) E x C 0’ k) x C(n — 1, j) x (n - ;)!. 

13. 2x2 array of darkened squares and “L” (a column of 3 squares beside a single 
square both have 1 + 4.v + 2x 2 ). 


CHAPTER NINE SOLUTIONS 
Section 9.1 

1. (a) Not symmetric, 

(b) Yes, 

(c) Not transitive, 


1 , 2 ) 

[4x6!]. 
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(d) Not transitive, 

(e) Not transitive. 

3. (a) 6 symmetries (as in Example 3), 

(b) 4 symmetries, 

(c) 1 symmetry. 

5. (a) All Cj left fixed, 

( CiC2C3C4CsC(,CTC^CgCioCnCi2CuCuClsCi6\ 

\CiC3C4C5C2C2C^CgC6CuCioCi3Ci4Ci5Ci2Ci6j 

( CiC2C3C4C5C6C7C8C9CioCnCi2Ci3Ci4Ci5Ci6\ 
yCiC3C2C5C4C6C9C8C 7 CnCioCi5Ci4Ci3Ci2Ci6 y / 

fdl ( ^lC2C3C4C5C6C7C8C9CioCnCi2Ci3Ci4Ci5Ci6\ 
VC1C2C5C4C3C9C8C7C6C10CHC14C13C12C15C167 

11. (a) a, b, c are rotations or 0°, 120°, and 240°, respectively d , e, f are flips around 
vertical axis, axis 30° clockwise of vertical, and axis 30° counterclockwise 
of vertical; row is first symmetry, column second symmetry: 


a b c d e f 


a 

b 


c 

d 


e 

f 


a b c 
b c a 
cab 
d e f 
e f d 
f d e 


d e f 
f d e 
e f d 
a b c 
cab 
b c a 


(b) Straightforward. 


T /I 2 3 4\ I ( 1 2 3 4\ /I 2 3 4\ j _ /I 2 3 4\ 

(c) Let a — (j 2 3 J, b — ( 2 1 4 3 ), c — ( 3 4 , 2 ), d — ( 4 3 2 J, 


abed 


a 

b 


c 

d 


abed 
bade 
c d a b 
d c b a 


13. Only left structure (right structure has 6 isomers). 

21. (b) {7Ti , 7T3, TTs, JTsl or {jt\, 7 T 3 , TtT, Ttg}, 

(c) In addition to subgroups in (b) and G, G' , G", other subgroups are { tv t it, j , 
for i = 3, 5, 6, 8. 


Section 9.2 

1. (a) 24, (b) 70. 

3. j[3 15 + (2 x 3 5 )]. 

5. 51. 
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7. i(5" + 5" /2 ) n even, and ±[5" + (3 x 5 ( "- 1)/2 )] n even. 

9. (a) In cycle form: ttj = (1)(2)(3), n 2 = (12)(3), tt 3 = (13)(2), 
tt 4 = (1X23), tt 5 = (123), jt 6 = (132), 

(b) i|i(3Ti) = C(12 + 3 - 1, 12) = 91, i|i(7 r 2 ) = «|»fo) = iKjt 4 ) = 7, ^(7r 5 ) 

= ^(tt 6 ) = 1, answer: i (91 +7 + 7 + 7+ 1 + 1) =19. 

11. 4»(jti) = 18, 1^(773) = 6, 1^(777) = ^(jig) = 12, and other (Jj(7r,)' = 0, answer 

g(18 + 6+ 12 + 12) = 6; 

13. (b) {7T ls 7T 7 }, (c) {7T! , 7T 6 }. 


Section 9.3 

1. 55. 

3. (a) 130, 

(b) 92, 

(c) Cyclic color sequence on hexagon of R-W-B-R-W-W and R-B-W-R-W-W. 
5. (a) |(m 4 + 2m 2 + 3m 3 ), 

(b) g(m 12 + 2m 3 + 3m 6 + 2m 7 ), 

(c) \(m 5 +m 3 ), 

(d) ±(m 8 + 3m 4 ), 

(e) + 2 m 2 + 2m 3 + 4m 4 + 3m 5 ), 

(f) Same as (b). 

7. (a) g(m 6 + 2m 2 + 3m 4 ), 

(b) g(m 12 + 2m 3 + 3m 6 + 2m 7 ), 

(c) i(m 6 + m 4 ), 

(d) |(m 10 + m 5 + m 6 + m 7 ), 

(e) 4,(m 12 + 2m 2 + 2m 4 + m 6 + 6m 7 ), 

(f) |(m 16 + 2m 4 + m 8 + 4m 9 ). 

9. | [7 4 + (3 x 7 2 )] = 637. 

11. (a) |(2 8 + 2 4 ) = 136, 

(b) |(2 8 + 2 4 + 0 + 2 4 ) = 72. 

13. 41 ( 71 , ) = number of cycles of length 1 . 

15. (a) i[m p + (p — 1) x m], 

(b) j l -{m p + [(/? - 1) x m] + (p x m ip+1)/2 )}. 


Section 9.4 

1. b 5 + fe 4 w + 2fe 3 w 2 + 2b 1 w 2 + few 4 + w 5 . 

3. ft 4 + w 4 + r 4 + fe 3 w + fe 3 r + few 3 + w 3 r + fer 3 + wr 3 + 2fe 2 w 2 + 2fe 2 r 2 
+ 2w 2 r 2 + 2b 2 wr + 2 bw 2 r + 2 fewr 2 . 

5. (a) g{(fe + w) 4 + 2(fe 3 + w 3 )(fe + w) + 3(fe 2 + w 2 )(fe + w) 2 }, 

(b) |{(fe + w) 12 + 2(fe 4 + w 4 ) 3 + 3(fe 2 + w 2 ) 6 + 2(fe 2 + w 2 ) 5 (fe + w) 2 }, 

(c) ±{(fe + w) 5 + (fe + w)(fe 2 + w 2 ) 2 }. 
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(d) \{(b + w) s + 3(b 2 +w 2 ) 4 }, 

(e) ±{(b + w) 7 + 2 (b 6 + w 6 )(b + w) + 2 (b 3 + w 3 ) 2 (b + w) + 4 (b 2 + w 2 ) 3 
(b + w) + 3(b 2 + w 2 ) 2 (b + w) 3 }, 

(f) Same as (b). 

7. (a) l{(b + w) 6 + 2(b 2 + w 2 ) 2 + 3(b 2 + w 2 ) 2 (b + w) 2 }, 

(b) l{(b + w) 12 + 2 (b 4 + w 4 ) 3 + 3 (b 2 + w 2 ) 6 + 2 (b 2 + w 2 )\b + w) 2 }, 

(c) \{(b + w) 6 + (b + w) 2 (b 2 + w 2 ) 2 }, 

(d) \{{b + w) 10 + (fc 2 + w 2 ) 5 + (b + w) 2 {b 2 + w 2 ) 4 + ( b + h-) 4 (& 2 + w 2 ) 3 }, 

(e) ±{(b + w) 12 + 2(b 6 + w 6 ) 2 + 2{b 2 + w 3 ) 4 + (fe 2 + w 2 ) 6 + bib 2 + w- 2 ) 5 
ib + w) 2 }, 

(f) |{(& + w) 16 + 2(b 4 + w 4 ) 4 + ( b 2 + w 2 f + 4(6 2 + w 2 ) 7 (& + w) 2 }. 

9. (a) i{(fc + w) 4 + 8(fe 3 + w 3 )(& + w) + 3(/7 2 + w 2 ) 2 }, 

(b) ±{{b + w) 6 + bib 4 + w 4 )(fe + w) 2 + 3ib 2 + w 2 ) 2 ib + w) 2 + bib 2 + w 2 ) 3 
8(/? 3 -f- vi ;3 ) 2 }. 

11. ±{ib + w) 4 + bib 4 + b 4 ) + 8 ib 3 + w 2 )ib + w) + 3 ib 2 + w 2 ) 2 + bib 2 + w 2 ) 
ib + w) 2 }. 

13. (a) If not a cyclic rotation of all comers, the length of the cycle would have to 
divide p — impossible, 

(b) dp, k)/p. 

15. (a) 36, (b) 216. 


CHAPTER TEN SOLUTIONS 
Section 10.1 

1. (a) {a, c} or { b , d], 

(b) /, 

(c) No kernel, consider directed 5-circuit b, a, d, g, h , b, a — if a is K (kernel), 
then d not in K , then g in K, then h not in K , then b in K — impossible since 
a in K ; similar sort of argument (also involving c, /, e) if a not in K . 

3. {3,4,9, 11, 12, 16, 17,21,25,26, 27, 31,32, 36, over 40}. 

5. A goes to 2, B must go to 4 or else A will win. 

7. Move to multiples of 5k + 1 (initial position is win for second player). 

9. By symmetry assume gia) = 0, then gie) = 1, then gid) = 0, then g(c) = 1, then 
gib) = 0, but now two kernel vertices are adjacent. 

11. S is a kernel if and only if all vertices not in S have an edge to a vertex in 
S while no vertex in S has an edge to a vertex in S — that is, if and only if 

WiS) = s. 

13. Follows from parts (a) and (b) of Exercise 12 since gix) = k means there is a path 
of length k starting at x while /(x) is length of longest path starting at x; longest 
path is at least length k (maybe there is another longer path starting at x). 
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15. Suppose x and y are adjacent because there is an edge from x to y ; then x 
must have a larger level number than y and a different Grundy value from its 
successor y. 

17. If there were an infinite number of vertices, then one of the finite number of 
starting vertices, call it q, must have an infinite number of vertices reachable 
from it, and one of the finite number of successors of q, call it q, must have 
an infinite number of vertices reachable from it, and one of the finite number 
of successors of q, call it q, must have an infinite number of vertices reachable 
from it, and so on, without end. 

19. (a) Let a = 0000, b = 000, c = 0_00, d = 00_0, e = 00, / = 0_0, g = 0, 
h =_(win); s{a) = {b. c, d, e, /}, s(b) = {e, /, g}, s(c) = s(d) = {e, /, g}, 
s(e)={g,h},s(f) = {g},.s(g) = h,s(h ) = 0;g(/) = g(h) = 0 ,g(a) = 
gig) = 1, g(e) = 2, g{b) = g(c) = g(d) = 3. 


Section 10.2 

1. (a) 7, remove 3 from 4th pile, 

(b) 0, 

(c) 4, remove 4 from 2nd, 3rd, or 4th pile, 

(d) 0. 

3. (a) 3, remove 3 from 3rd pile or 2 from 4th pile, 

(b) 2, remove 2 from 3rd or 4th pile, 

(c) 0, 

(d) 0. 

5. (a) 0, 

(b) 0, 

(c) 1, remove 1 from 4th pile, 

(d) 3, remove 1 from 3rd pile. 

7. (a) 3, add nickel to 3rd pile, 

(b) (0, 0), (0, 4), (0, 6), (0, 9), (1, 1), (2, 2), (2, 5), (2, 8), (3, 3), (3, 7), (4, 4), 
(4, 6), (5, 5). 

9. If cj = dj , then trivially d 4- Cj =c' + dj ; if c' + Cj = c' + dj , then cj and dj 
must have Is in the same positions in their binary representations — that is, they 
must be equal. 

11. Immediately the proposed strategy works. 

13. (a) Remove three balls along one of the three lines formed by ball on one of the 
three sides of the arrangement. 

POSTLUDE SOLUTIONS 

1. THEORIES 
3. COMPLETE 
5. FAMILY 
7. UNIFORM 
9. VERTICES 
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APPENDIX SOLUTIONS 

Section A.l 

1. (a) 12, 27 

(b) 2, 3, 6, 7, 9, 12, 15, 17, 18, 21, 22, 24, 27, 

(c) 1, 4, 5, 8, 10, 11, 13, 14, 16, 19, 20, 23, 25, 26, 28, 29, 

(d) all 1 < A: < 29 except 12, 27. 

3. We are given N(R(1 M) = 2 as well as that N(M) — N(R) = N(M) = 
N(R) — 4; then N(R n M) = N(M) - N(RD M) = 4 - 2 = 2, N(RH M) = 
N(R) - N(R n M) = 4 - 2 = 2, and clearly flM) = 2. 

5. (a) Impossible, 

(b) Yes, 20 - 8 - 8 = 4, 

(c) 20-15 = 5. 




(c) (A U B) fl (A fl B) = (A fl B), see Figure A1.3, 

(d) A — (B — A) = A; here all expressions involve fl, so that ABC — AC\ B C\ C . 



17. (a) E = S U H U C, 52 3 - 39\ 

(b) £ = (SU//UC)n(Sn//nC),52 3 -39 3 - 13\ 

(c) e = (S n h n C) u (S n h n c) u (5 n h n c), 3 x 13 2 x 39. 


Section A.2 

23. One can prove by induction only a property that is a function of n — for example, 
one can prove that there are a finite number of binary sequences of length <n. 
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25. The initial step only assumes that n > 1, not n > 2, but for n = 1, a n 2 is unde- 
fined. 

Section A.3 

1. 1/2. 

3. (a) 1/6, 

(b) 18/36=1/2, 

(c) 3/36= 1/12. 

5. (a) 1/6, (b) 1/2. 

7. (a) 1/3 x 1/3 = 1/9, (b) 2 x 1/3 x 2/3 = 4/9. 

9. (2 x 2 x 2)/4! = 1/3. 

11. 1 — [(50 + 40 — 20)/ 100] = 3/10. 

13. 2/3. 

15. (a) All sequences with k tails, 0 < k < 8, and one head followed by a head, 

(b) All positive integers, 

(c) All ordered pairs of positive integers, 

(d) All sequences of k black balls, k>0, followed by a red ball. 

Section A. 4 

1. n + 1 . 

15. Printer i is connected to computers i, i + 1, i + 2, i + 3, i + 4, i + 5. 
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AC Principle, 16 
Addition Principle, 180 
Adjacency matrix, 103 
Adjacent vertices, 3 
Adler, 1,316,317 
Agnarsson, G., 439 
Ahuja, R„ 128, 174, 175 
Aldous, J„ 439 
Allenby, R„ 440 
Ancestors in a tree, 94, 438 
Andreescu, T., 440 
Appel, K„ 32, 69, 80, 86 
Arrangement(s), 190, 435 
with repetition, 206 
Assignment problem, 121, 175 
Art gallery problem, 79 
Augmenting flow algorithm, 142 
Ault, L., xiii, xv 
Automorphism of a graph, 48 

B 

Backtracking in a graph, 104, 438 

Balakrishnan, V., 439, 440 

Balanced tree, 96, 438 

Ball, W„ 400 

Barnette, D., 86 

Berge, C., 400, 440 

Berlekamp, E., 400 

Bernoulli, Jacob, 237 

Bernoulli, Jacques, 237, 425 

Best, S„ 439 

Biggs, N., 44, 86, 440 

Binomial coefficient, 190, 228, 435 

Binomial identities, 230, 261, 

291, 329 

Binomial Theorem, 227 


Bipartite graph, 5, 27, 67, 153, 435 
deficiency, 163 
Birthday paradox, 203 
Blockwalking, 230 
Boat crossing puzzles, 108 
Bogart, K„ 439 
Bondy, J„ 44, 87 
Bono, M„ 439 
Boole, G.,418 
Boolean algebra, 416 
Bouton, C., 400 
Boyer, C., 418 

Branch-and-bound search, 113 
Breadth-first search, 104, 438 
Bridge in a graph, 46 
Bridge probabilities, 215 
Brook’s Theorem, 80 
Brualdi, R„ 439 
Bubble sort, 122 
Buck, R„ 198, 237 
Burnside’s Lemma, 365 
Bussey, W„ 421 

C 

C(n,r), 190 

Cameron, P., 281, 439 
Capacity of a cut, 137 
of an edge, 135 
Capobianco, M., 440 
Cardano, B., 425 
Catalan number, 313 
Cayley, A„ 44, 99, 1125 
Center of a tree, 101 
Chain in a network, 142 
Characteristic equation, 300, 310 
Characteristic sequence of a tree, 112 
Chartrand, G., 440 
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Chessboard, generalized, 342 
Children in a tree, 94, 438 
Chromatic number, 69, 80, 435 
Chromatic polynomial, 81, 435 
Chvatal’s theorem, V., 61 
Circle graph, 5 1 
Circle-chord method, 343 
Circuit in a graph, 4, 435 
length of a circuit, 27 
Closure in a group, 359 
Code text, 404 
Coin balancing, 98 
Color critical graph, 79, 85 
Coloring a graph, 69, 334, 433 
Combination(s), 190, 435 
Complement: 

of a chessboard, 351 
of a graph, 17, 85, 435 
of a set, 416 

Complete graph, 15, 25, 435 
Component of a graph, 25, 436 
Computational complexity, 430 
Configuration in a graph, 35 
Conjugate diagram, 268 
Connected graph, 4, 435 
Strongly connected, 55 
Test for connectedness, 104 
Conway, J., 400 
Cook, S. 431 
Cormen, T., 298, 317 
Coxeter, H., 400 
Crossing number, 42 
Cryptogram, 401 
Cube, symmetries of, 379 
Cut in network, 136 
a-z cut, 136 
Cut-set, 47, 151 
Cycle, 49, 435 
Cycle in a permutation, 359 
Cycle index, 371 

Cycle structure representation, 370 

D 

David, F„ 236, 237, 425 
Deadheading edge, 52 


Decomposition principle for Instant 
Insanity, 88 

Deficiency of a bipartite graph, 163 
Degree of a vertex, 6, 24, 
in-degree and out-degree, 18 
Degree of a region, 38 
De Carteblanche, F., 88 
De Moivre, A.,281,316, 351 
De Morgan, A., 389, 417, 421 
Depth-first search, 104, 438 
Derangement, 333, 436 
Descendants in a tree, 94, 438 
Diameter of a graph, 67 
Dictionary search, 5, 97 
Difference equation, 290 
Difference of sets, 416 
Digraph, 40 1 
Digital sum, 395 

Dijkstra’s algorithm for shortest paths, 
127 

Dirac’s theorem, 61 
Direct sum of graphs, 394 
Directed graph, 3, 436 
Dirichlet drawer principle, 427 
Distribution, 215, 289, 436 
Divide-and-conquer relations, 296 
Dual graph, 32 
Durfee square, 270 

E 

Edge, 3, 24 
directed edge, 3, 436 
Edge chromatic number, 80 
Edge coloring, 73 
Edge cover, 8, 155, 436 
Edge coloring, 73 
Elements of a set, 415 
Eliminating a team from contention, 157 
Equivalence relation, 356 
equivalence class, 356 
Erickson, M., 439 
Euler, L„ 37, 44, 49,51,86, 281 
Euler’s constant e, 198, 333 
Euler cycle, 50, 112, 436 
Euler’s formula for graphs, 37 
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Euler trail, 53, 436 
Event of outcomes, 423 
compound event, 424 
elementary event, 423 
Expected value of a random variable, 
265 

Experiment, 423 

F 

Factor in a graph, 89 
labeled factor, 89 
Family of elements, 415 
Fed, T„ 414 
Feller, W„ 281 
Feng, Z., 440 
Fermat, P., 236, 425 
Ferrers diagram, 268 
Fibonacci, 316 

Fibonacci numbers, 281, 285, 302, 310, 
316 

Fibonacci relation, 285 
Fisk’s theorem, 79 

Fleury’s algorithm for Euler cycles, 55 
Flow in network, 157 
dynamic flow, 147 
value of a flow, 137 

Floyd’s algorithm for shortest paths, 129 

Ford, L„ 174, 175 

Forest of trees, 101 

Four-color problem, 32, 69, 80, 86 

Fourier, J„ 281 

Fourier transform, 28 1 

Fulkerson, D., 174, 175 

G 

Gaines, H., 414 
Galileo, 425 
Garbage collection, 72 
Generating functions, 249, 308, 436 
exponential, 272 
Generators of a group, 362 
George, J., 440 
Graham, R., 427, 439 
Graph, 3, 436 
Gray code, 62 


Greenlaw, R., 440 
Grinberg’s theorem, 61 
Gross, J., 440 
Group of symmetries, 360 
Grundy, P„ 390, 400 
Grundy function, 390, 395 
Guy, R„ 400 

H 

Haken, W„ 32, 69, 80, 86 
Hall, M„ 440 

Hall’s Marriage Theorem, 156 
Halmos, P.,418 
Hamilton, W., 86 

Hamiltonian circuit and path, 56, 86, 
119, 436 

Harary, F„ 383, 440 
Harris, J., 440 
Hartsfield, N„ 440 
Heap, 123 
Heap sort, 123 
Height of a tree, 96, 438 
Hillier, F„ 175 
Hopcroft, J., 163 
Hypercube, 63 

I 

Identical Objects Rule, 425 
Identity in a group, 359 
Inclusion-exclusion formula, 328 
Inclusion-exclusion principle, 320 
Independent edges, 153 
Independent set, 8, 69, 436 
Induction, 420 
Initial conditions, 283 
Instant Insanity puzzle, 87 
Integer solutions of an equation, 217, 
250, 332 

Interest problems, 287 

Internal vertex in a tree, 95, 438 

Intersection of sets, 416 

Inverse of a symmetry, 359 

Isolated vertices, 16 

Isomers of organic compounds, 362 

Isomorphism of graphs, 14, 436 
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K 

K n (complete graph on n vertices), 15 

Kan, A. R., 126 

Kayles, 393 

Kernel of game, 387 

Keys in Mastermind, xi 

Keyword, 405 

Keyword transpose encoding, 405 
Kiefer, J„ 316, 317 
Kirchhoff, G., 44, 125 
Knight’s tour, 55, 67 
Knuth, D„ 440 
Konig, D., 44 

Konig-Egevary theorem, 156 
Konigsberg bridges, 49 
Kruse, R., 126 

Kruskal’s algorithm for minimal 
spanning trees, 131 
Kuratowski’s theorem, 35 

L 

Laplace, S„ 281, 423, 425 
Laplace transform, 28 1 
Lawler, E., 126 
Leaves of a tree, 95, 438 
Leibnitz, G., 237 
Leiserson, C., 298, 317 
Lenstra, J., 126 
Lesniak, L., 440 
Level in a game, 388 
Level numbers in a tree, 94, 438 
Letter frequencies, 401 
Lewand, R., 414 
Lieberman, G., 175 
Line graph, 41, 48, 55, 68 
Linear program, 175 
Lloyd, E., 44, 86,440 
Lucas, E., 126 

M 

MacMahon, R, 281 
Magnanti, T„ 175 
Map coloring, 32, 437 
Marcus, D„ 440 
Martin, G., 440 


Mastermind, xi 

Matching in a graph, 4, 153, 437 
maximal, 153 
X-matching, 153 
Matching network, 154 
Maurolycus, 421 
Max flow-min cut theorem, 145 
Maximal planar graph, 42 
Maze searching, 106 
Mazur, D.,439 
Member of a set, 415 
Menage, 350 
Merge sort, 122 
Meriss, R., 440 
Minimal spanning tree, 131 
Minimum cost rule, 174 
Missionaries-cannibals puzzle, 108 
Molluzzo, J., 440 

Moments of a random variable, 265 
fc-th moment, 277 
Montmort, P., 351 
Mountain climber’s puzzle, 25 
Multigraph, 49, 437 
Multiplication, fast, 298 
Multiplication Principle, 180 
Murty, U„ 44, 87, 440 

N 

Network, 127, 437 
Network flow, 135, 437 
Nim game, 393, 400 
Northwest corner rule, 168 
NP-completeness, 57, 69, 113, 431 
Null set, 415 

O 

Ore, O., 440 
Orlin, J„ 175 
O’Rourke, J„ 79, 87 

P 

P (n,r), 190 
Palmer, E„ 383 
Parent in a tree, 94, 438 
Parenthesization, 289, 312 
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Partitions, 266, 437 
of an integer, 266 
self-conjugate, 270 
Pascal, B„ 236, 421,423,425 
Pascal’s triangle, 229, 236 
Patashnik, O., 440 
Path in a graph, 4, 437 
directed path, 10 
length of a path, 27 
Pattern inventory, 353 
Peacock, G.,417 
Permutation, 190, 437 
r -permutation, 190 
Pigeonhole Principle, 427 
Pisa, L., 316 

Pitcher pouring puzzle, 106 
Plain text, 404 

Planar graph, 31, 80, 152, 437 
Plane graph, 3 1 
Platonic graph, 43 
Poker probabilities, 192, 331 
Polya, G., 229, 377, 382, 383, 440 
Polya’s enumeration formula, 377 
Polygon, 78 
Power series, 249 

Prim’s algorithm for minimal spanning 
tree, 131 

Probability of an event, 423 
Probability generating function, 265, 
277 

Progressively finite game, 385 
Prufer sequence, 100 

R 

Ramsey theory, 47, 427 
Random variable, 265 
Range graph, 25 
Range in a matching, 156 
Recurrence relation, 283, 437 
homogeneous relation, 300 
inhomogeneous relation, 304 
systems of recurrence relations, 289 
Reade, R„ 383 
Reflexivity in a relation, 356 
Region, 37 


Regular graph, 45 
Rencontre, 339 
Ringel, G., 440 
Riordan, J„ 351, 440 
Rivest, R., 298, 317 
Roberts, F., 439 
Rook, non-capturing, 342 
Rook polynomial, 343 
Root of a tree, 93, 438 
Rotation of a figure, 356 
Rothschild, B„ 427, 439 
Round-robin tournament, 46, 73 
Rubik’s cube, 68 
Run in a sequence, 222 
Ryser, H., 440 

S 

Sample space, 424 
Sandefur, J„ 290, 317 
Satisfiability problem, 43 1 
Saturated and unsaturated edges, 139 
Scheduling problems, 8, 29, 71, 72 
Schwartz, B„ 157, 175 
Secret code, xi 
Sedgewick, R., 126 
Selection, 190, 437 
Selection with repetition, 207 
equivalent forms, 218, 223 
Self-complementary graph, 48 
Set, 415 

Set Composition Principle, 194 
Set of distinct representatives, 154 
Shapley-Shubik index, 197 
Shih-Chieh, C., 237 
Shmoys, D., 126 
Shortest path algorithms, 127 
Sibling in a tree, 94, 438 
Sink in a network, 135 
Sinkov, A., 414 
Slack in an edge, 139 
Slomson, A., 440 
Sorting algorithms, 121 
Source in a network, 135 
Spanning tree, 104, 131, 167 438 
Spaziergangen problem, 49 
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Spencer, J., 427 
Stanley, R., 440 
Stanton, D. and R., 440 
Stirling number, 275 
Stirling’s approximation forn!, 198 
Street surveillance, 6 
Street sweeping, 52 
Strongly connected graph, 45 
Subdivided edge, 35 
Subgraph, 15, 437 
Subgroup, 362 
Subset, 415 
proper subset, 415 
Subtree, 94, 438 
Successor of a vertex, 388 
Summation methods: 

using binomial identities, 230 
using generating functions, 277 
using recurrence relations, 305 
Sylvester, J., 44, 351 
Symmetry: of a geometric figure, 356 
in a relation, 356 

T 

Takeaway games, 385 
Tetrahedron, symmetries of, 358 
Tournaments, 44, 62, 73, 102, 297 
Tower of Hanoi game, 286 
Trail, 47, 49, 437 
Transitive closure, 130 
Transitivity in a relation, 356 
Transportation problem, 1 64 
Transportation tableau, 165 
Transposition in a permutation, 363 
Traveling salesperson problem, 113, 
125,431 

Traversal of tree, 118 

inorder, postorder, preorder, 118, 438 
Tree, 93, 437 
binary, 95, 438 


m- ary, 95, 438 
rooted, 93, 438 
Tree sort, 125 
Trial of experiment, 424 
Triangulation of a graph, 78 
Trigraph, 402 

Trigraph frequency table, 402 
Tripartite graph, 41 
Tutte, W„ 87 

U 

Union of sets, 416 
Unit-flow chain, 142 
Unit-flow path, 139 
Universal set, 415 

V 

Value of a flow, 137 
Vandermonde, A., 86 
VanLint, J„ 440 
Venn diagram, 320, 417 
Vertex, 3 
Vertex basis, 10 
Vizing’s theorem, 80 
Voter power, 1 97 

W 

Wallis, W„ 439 

West, D„ 44, 440 

Wheel graph, 70 

White, D„ 440 

Whitworth, W., 237 

Wilson, R„ 44, 86, 440 

Winning position in a game, 386 

Winning strategy in a game, 386 

Winning vertex, 386 

Woods, J„ 440 

Y 

Yellen, J„ 440 


